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A rKNOWT .EDGMENT 

This invention was made with Government support under Grant No. POl AG- 
10435 awarded by the National Institute of Aging. The U. S. Federal Government has 
5 certain rights in the invention. 

FTETn OF THE INVENTION 

The present invention relates to methods in the field of recombinant DNA 
technology, and products related thereto. In a particiilar aspect, the invention relates to 
methods for modulating the expression of exogenous genes in mammalian or non- 
1 0 mammalian systems, and products useful therefor. 

BACKGROUND OF THE INVENTION 

It is known in the art to produce fusion proteuis for a number of purposes. In 
some cases, the two protein units in the single polypeptide have two essentially 
independent activities. The most common example of this application is the fusion of 

15 marking proteins, such as GFP, to intracellular factors as a means of observing their 
localization and expression (see, for example, A.W. Kerrebrock et al, Cell, 83:247- 
56, 1995; H.G. Wang et al. Cell, 87:629-638, 1996). Creation of fusion proteins has 
also been used to prolong the half-life of a protein (see, for example, R.A. Hallewell, 
et al, J. Biol Chem., 264:5260-5268, 1989; T.P. Yao et al. Cell, 77:6372, 1992) as 

20 well as other uses (see, for example, T. Sano et al, Proc Natl Acad Sci U.S.A. , 
89:1534-1538, 1992). 

A more complicated application of protein fusion is the production of fusion 
proteins wherein the two protein units cooperate to achieve a biological function. In 
functional dimers, both proteuis must fold and interact with each other appropriately. 
25 V.A. Garcia-Campayo et al. {Nature Biotech, 15:663-667, (1997)) have utilized a 
peptide linker to fuse gene subunits together into a single biologically active peptide. 
Neuhold and Wold, (Cell, 74:1033-1042, (1993)) have reported the fusion of two 



2 



proteins into a single biologically active protein that binds DNA targets, wherein the 
protein units interact with each other to the exclusion of competing heterodimer 
partners. However, fusion of proteins with multiple functions has been more difficult 
to produce, for example, steroid/thyroid hormone nuclear receptors are complex, 
5 multifiinctional proteins with, minimally, four interconnected yet separable functions: 
ligand binding, dimerization, DNA binding, and transactivation. 

Steroid/thyroid hormone nuclear receptors are used in the field of genetic 
engineermg as a tool for studying control of gene expression and to manipulate and 
control development and other physiological processes. For example, apphcations for 
10 regulated gene expression in mammalian systems include inducible gene targeting, 
overexpression of toxic and teratogenic genes, anti-sense RNA expression, and gene 
therapy (see, for example, R. Jaenisch, Science 240:1468-1474, 1988). For cultured 
cells, glucocorticoids and other steroids have been used to induce the expression of a 
desired gene. 

1 5 As anotiier means for controlling gene expression in mammalian systems, an 

inducible tetracycline regulated system has been devised and utilized in transgenic mice, 
whereby gene activity is induced in the absence of tetracycline and repressed in its 
presence (see, e.g, Gossen et al PiV^S" 89:5547-5551,1992; Gossen etal, TIBSHAH- 
475, 1993; Furth et al., PNAS 91:9302-9306, 1994; and Shockett et al, PNAS 

20 92:6522-6526, 1995). However, disadvantages of the mducible tetracycline system 
include the requirement for continuous administration of tetracycline to repress 
expression and the slow clearance of antibiotic from bone, a side-effect that mterferes 
with regulation of gene expression. While this system has been improved by the recent 
identification of a mutant tetracycUne repressor that acts conversely as an inducible 

25 activator, the pharmacokinetics of tetracycline may hinder its use during development 
when a precise and efficient "on-ofE" switch is essential (see, e.g., Gossen et al. Science 
268:1766-1769, 1995). 

Certain insect steroid/thyroid hormone nuclear receptors have also been studied. 
The Drosophila melanogaster ecdysone receptor (EcR) (M. R. Koelle et al, Cell 
30 67:59-77, 1995) is unlike the estrogen, androgen, and other homodimeric vertebrate 
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steroid hormone nuclear receptors because it requires a heterologous dimer partner for 
functional transactivation. The obligate dimer partner, the product of the 
ultraspirade (Usp) gene (V. C. Henrich et al, Nuc. Acids Res. 18: 4143-4148, 1990; 
T. P. Yao et al, supra, 1992; T. P. Yao etal, Nature 366:476-479, 1993), is an insect 

5 homolog of the mammalian retinoid X receptor (RXR) proteins found in vertebrates 
and other mammahan species. RXRs have been characterized as regulatory dimer 
partners of many mammalian class II steroid/thyroid hormone nuclear receptors, such 
as the thyroid hormone receptors, the retinoic acid receptors, and the vitamin D 
receptor (reviewed in Mangelsdorf and Evans, Cell 83:841-850, 1995; D. J. 

10 Mangelsdorf al, Cell 83: 835-839, 1995). RXR is also a dimer partner of EcR. 

Usp and RXR share a significant degree of sequence homology and some 
functional similarities; however, in formation of heterodimers with EcR, RXR 
interacts differently than Usp. One primary difference is that formation of EcR+RXR 
heterodimers is more highly stimulated by the steroid ligand ecdysteroid muristerone 

15 A (murA) than by 20-hydroxyecdysone (20-Ec), while formation of EcR+Usp 

heterodimers is potently stimulated by 20-hydroxyecdysone (K. S.Christopherson et 
al, Proa Natl Acad Sci USA 89:6314-6318, 1982; H. E. Thomas etal. Nature 
352:471-475, 1993). A second difference is in the way that ligand promotes efficient 
formation of EcR+Usp and EcR+RXR heterodimer complexes and concomitant 

20 binding to ecdysone response elements (EcREs). MurA stimulates EcR+Usp binding 
of EcREs approximately 3 to 7-fold over levels without ligand, but EcR+RXR 
complexes are completely dependent on ligand for heterodimerization. Further 
EcR+RXR complexes bind to EcREs at only 10-40% the level of EcR+Usp 
complexes (Christopherson et al, supra 1982; Thomas et al, supra 1993; Yao et al, 

25 supra, 1992 & 1993). This suggests that the affmity of EcR for its natural dimer 
partner, Usp, is significantly greater than its affinity for RXR. 

EcR has been studied for use in transgene regulation; however, its use for this 
purpose is complicated by the requirement for superphysiological levels of RXR 
protein to be coexpressed (No et al, supra 1997), presumably because of the 
30 comparatively low affinity of EcR for RXR as a dimer partner. Of the mammalian 
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cell types heretofore examined, only the 293 cell line appears capable of supporting 
high level transactivation of EcR without added RXR (Christopherson et al, supra, 
1982). The requirement for co-expression of RXR in most mammalian systems raises 
concerns that RXR will heterodimerize with endogenous mammalian class II 
5 steroid/thyroid hormone nuclear receptors, causing altered differentiation, growth, or 
fitness of transduced cells. 

A number of ecdysone receptors are known in the art as being a gene sequence 
responsive to an applied exogenous chemical inducer enabling external control of 
expression of the gene controlled by the receptor (See, for example, 
10 PCT/GB96/01195). 

Accordingly, there is a need in the art for improved systems to precisely 
modulate the expression of exogenous genes in mammalian subjects. For example, a 
non-mammalian-based transcription regulating system would be extremely desirable for 
general application to transgene regulation in in vitro, ex vivo, and in vivo applications. 
15 In addition, there is a need in the art for new and better methods of using 

steroid/thyroid hormone nuclear receptors that require a dimer partner for functional 
transactivation of transgene expression for use in somatic gene therapy and for 
laboratory models thereof. 

BRIEF DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention, there are provided chimeric proteins 

comprising at least two functional protein units, wherein each functional protein unit 
comprises the dimerization domain of a member of the steroid/thyroid hormone 
nuclear receptor superfamily, and an optional linker interposed therebetween, wherein 
the at least two protein units form a functional entity. When the chimeric protein 

25 contains two functional protein units, the chimeric protein forms a functional dimer 
(FD), for example a heterodimer or a homodimer. In one embodiment according to 
the present invention, each protein unit comprises a ligand binding domain and an 
optional hinge domain of a steroid/thyroid hormone nuclear receptor member, and an 
optional DNA binding domain. The functionality of the entity is independent of the 
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order of the protein units in the chimeric protein. Polynucleotides encoding the 
invention chimeric protein and cells containing such polynucleotide(s) are also 
provided according to the present invention. In one embodiment according to the 
present invention, the invention polynucleotide encodes the invention chimeric 
5 protein as a fusion protein, with one or more linker(s) encoded as a polypeptide linker. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating the expression of exogenous gene(s) in a subject 
organism containmg DNA construct(s) encoding and expressing invention chimeric 
protein(s) and DNA construct(s) encoding and expressing exogenous gene(s) under 
10 the control of a response element. The invention method for modulating the 

expression of exogenous gene(s) in a subject organism comprises administering to the 
subject an effective amount of an exogenous ligand for at least one functional unit of 
the chimeric protein. 

The present DNA binding studies indicate that many of the invention 
15 functional dimers (FDs) display DNA binding equivalent or superior to that of 

receptor complexes formed from and/or containing identical wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily (i.e., the same two members 
from which the invention chimeric protein is derived). Transient transfection analysis 
reveals that distinct groups of FD constructs transactivate responsive promoters in a 
20 manner similar to wild-type complexes, while others lose the capacity to transactivate 
and function like constitutive repressors. 

Competition experiments and supporting data reveal that FDs favor 
dimerization with duner partners contained within a chimeric protein over interaction 
with other wild type dimer partners. These results demonstrate that certain of the 
25 invention chimeric protein FDs share properties of monomeric receptor complexes 
while others have novel characteristics iinique to individual constructs. 



To enhance the possibility of producing a functional entity upon expression, 
the invention chimeric proteins allow for any of the protein units to be positioned at 
the amino terminus of the chimeric protein. In addition, to enhance flexibility for 
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proper folding and three-dimensional orientation of the protein units into a functional 
entity, an optional linker can be interposed between the protein imits in the chimeric 
protein. A variety of different linkers can be used in the invention chimeric proteins, 
including chemical and polypeptide linkers, with the latter being preferred if the entity 
5 is expressed as a fusion protein. In a presently preferred embodiment, the linker is 
designed to allow for incremental elongation of the linker distance interposed between 
the two protein units. 

BRIEF DESCRIPTION OF THE FIGURES 

In the interests of brevity and consistency, the names of receptors and dimer 

10 partners have been abbreviated for use herein as follows: "E", "U", or "R" alone 
indicates a monomeric receptor protein or dimer partner (i.e., not contained in an 
invention chimeric protein) containing, respectively, at least the ligand binding 
domain of the Drosophila ecdysone receptor, the ultraspiracle protein, or the retinoid 
X receptor. When contained within an invention "fusion protein", which is 

15 alternatively referred to herein as a "functional dimer", these receptor proteins are 
represented by "E", "U", or "R" separated by either an "N", representing a linker of 
any length, or a nvimeral from 0 to 20, indicative of a linker containing a specific 
number of linker segments wherein each linker segment contains 12 amino acids. In 
the description of invention fusion proteins, which are functional dimers, the leading 

20 letter in the abbreviation indicates the receptor protein at the amino terminus of the 
fusion protein. For example, E5U means a fusion protein having at least the ligand 
binding domain, hinge domain, and optionally functional DNA binding domain of 
Drosophila ecdysone receptor at the amino terminus, a linker containing 5 linker 
segments (of 12 amino acids each, plus the 5 amino acid linker bridge (i.e., a linker 

25 containing a total of 65 amino acids) and the comparable domains of the ultraspiracle 
protein. An initial "V" in the construct abbreviation, e.g., VE5U or VE, indicates 
fusion of the VP16 x activation domain to the N-terminus of the fusion protein, as 
described more completely hereafter. 

Figure 1 is a schematic diagram of an nucleic acid construct encoding 
30 invention fusion proteins that contain EcR (darkly shaded) with a dimer partner, U 
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(Usp) or R (RXR) (darkly shaded). "D" = DNA binding domain; "L" = ligand 
binding domain; curvilinear line = fusion bridge. "Individual" represents a nucleotide 
sequence that encodes the wild type C terminus of EcR (receptor) and the monomeric 
N-terminus of RXR (binding partner) before introduction of a nucleotide sequence 
5 encoding a fusion bridge. "Fused" represents the same segments with nucleotides 
inserted that encode a 5 amino acid fusion bridge containing the Sfil insertion site. 
"Tether" indicates a nucleotide sequence that encodes a 12 amino acid linker to be 
inserted into the Sfil site of the fusion bridge to produce fusion proteins with greater 
spacing between the two protein units (i.e., dimer partners) in the invention fusion 
10 protein. 

Figures 2A-B illustrate the results of gel mobility shift assays of response 
element binding of the invention FD constructs having linkers containing either 0 or 5 
linker segments as compared with that of monomeric receptor complexes (translated 
in vitro). 

15 Figure 2 A is a graph quantifying gel mobility shift as a result of response 

element binding to invention endodimer FDs in the presence of murA. Controls were 
treated either with vehicle (open bars) or with 1 mvxA as ligand (black bars). Bars 
are labeled along the bottom with FDs named as described in the text. E represents an 
EcR only control; NON represents a non-transfected control. E+U and E+R are 

20 control lanes of monomeric in vitro translated proteins used for sizing of endodimer 
band shifts. Numbers at the top of each bar represent relative-fold increase in 
response element binding resulting from ligand treatment. 

Figure 2B is a schematic representation of five F-domain deletion constructs 
containing EcR (darkly shaded) and RXR (lightly shaded) with no linker polypeptide 
25 (EOR). Incremental deletions are shown to Nhel, PvuII, Narl, and Bglll sites within 
the ecdysone receptor F domain. The top schematic represents EOR (1340 amino 
acids) containing the complete F-domain (bracketed); the second schematic represents 
a deletion (A60 amino acids) to the Nhel site; the third schematic represents a deletion 
(A138 amino acids) to the PvuII site; the fourth schematic represents a deletion (A198 
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amino acids) to the Narl site; and the fifth schematic represents a deletion (A228 
amino acids) to the Bglll site. 

Figure 3 is a graph showing relative luciferase expression induced by FD 
constructs with or without ligand as determined in transient transfection assays for 
5 FDs and for monomeric EcR with either Usp or RXR when treated with vehicle (open 
bars) or 1|liM murA. Decimal numbers on the abscissa represent molar amount of FD 
relative to VE plasmid (1.0 is equimolar FD:VE). EOU and UOE without VE 
cotransfection are at the extreme right of the bar graph. See also Table 1 . 

Figures 4A-B are two graphs illustrating the results of transient transfection 
1 0 assays conducted using either VP 1 6-fused monomeric receptors or invention fusion 
protein FDs with increasing linker lengths. Figure 4A is a graph showing a 
comparison of luciferase activity in relative light units (RLU) in transient transfection 
assays conducted with or without ligand, using either monomeric receptors having 
amino terminal fiised VP 16 activation domains or invention FDs containing EcR, 
1 5 RXR, and a linker with a variable number of linker segments. Cells were treated with 
vehicle (open bars), or 1 p,M muristerone A as ligand (black bars). Numbers at the 
top of the bars indicate the fold-increase relative to FD or monomeric receptor 
without addition of monomeric VRXR (VR) or monomeric VUsp (VU). E = EcR 
only; E4-luc = reporter plasmid only; and Figure 4B is a graph showing a comparison 
20 of luciferase activity as in Figure 4A herein, except that the FDs contain Usp in place 
of RXR. 

Figure 5 is a series of three graphs showing repression of ligand-stimulated 
luciferase expression by monomeric receptors caused by competition with the 
invention ENU and UNE FDs when transiently co-transfected into 293 cells and 

25 treated either with vehicle (open bars), or 1 muristerone A as ligand (black bars). 
Decimal numbers on the abscissa represent molar amount of FD relative to VE 
plasmid (1 .0 is equimolar FD:VE). Figure 5A shows a comparison of the inhibitory 
effects of EOU and UOE on ligand-stimulated expression of luciferase by monomeric 
VE in combination with endogenous RXR. EOU and UOE without VE cotransfection 

30 are at the extreme right of the bar graph; Figure 5B shows the effects of monomeric 



9 



EcR (without VP 16 fusion) on ligand-stimulated expression of luciferase in the assay 
of Figure 5 A by competition with EOU or UOE; and Figure 5C shows the effects of 
monomeric EcR on luciferase expression in the presence of ligand in the assay of 
Figure 5B, as compared with VE combined with monomeric exogenous Usp. 

5 Figure 6 is a graph showing a comparison of results (in RLU) obtained in 

assays in which E5U and E5R compete with monomeric VRXR (VR) or monomeric 
VUsp (VU) in 

the presence of vehicle only (open bars) or murA as ligand (black bars). FDs and 
receptor combinations are labeled along the abscissa. Nimibers above the bars 
10 represent the fold-increase relative to FD or receptor without addition of VR or VU. 
E4LUC at the extreme right is reporter plasmid alone as control. 

Figures 7A-E are a series of six schematic diagrams representing possible 
conformations of receptor FDs described in the text. Shaded and white 
oval/rectangles represent receptors, small rectangles with interior arrows represent 
1 5 EcREs, and curvilinear lines represent linkers between protein units in the invention 
fusion proteins. Figure 7 A represents a native dimer; Figure 7B represents a 
disorganized fusion protein; Figure 7C represents an endodimer orientation of a 
single invention FD; Figure 7D represents a tetramer of two invention FDs; Figure 
7E represents a multimer of four invention FDs. 

20 Figure 8 is a series of schematic representations of invention FDs containing 

Bombyx ecdysone receptor (BEcR) plus the entire F domain of the Drosophila 
melanogaster ecdysone receptor (amino acids 650 to 878) (DE), which segment is 
included for ease in making the construct (DEcR). The BEcR is at the amino 
terminus of the fusion protein with either RXR or Usp as the dimer partner at the 

25 carboxy terminus of the fusion protein. "D" = DNA binding domain; "L" = ligand 
binding domain; curvilinear line = fusion bridge. "H" = an N-terminal His tag for 
protein purification. 



10 



DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there are provided chimeric proteins 
comprising at least two ftinctional protein xinits, wherein each functional protein unit 
5 comprises the dimerization domain of a member of the steroid/thyroid hormone 

nuclear receptor superfamily, and an optional linker interposed therebetween, wherein 
the at least two protein units form a functional entity. When the chimeric protein 
contains two functional protein units, the chimeric protein forms a functional dimer 
(FD), for example a heterodimer or a homodimer. 

10 The invention chimeric proteins form functional entities (e.g. functional 

dimers) under a variety of conditions. Such conditions include, but are not hmited to, 
those at or near physiological conditions (e.g., in saline at body temperature). Those 
of skill in the art will understand that formation of invention functional entities by 
dimerization or crystallization of a macromolecule can be influenced by manipulation 

15 of a variety of physical parameters, such as are disclosed in McPherson, Eur. J. 

Biochem., 189 :1-23. 1990, which is incorporated herein by reference in its entirety. 
Due to the proximity of the protein units within the invention chimeric protein, 
dimerization tends to take place with intramolecular partners, rather than with other 
suitable monomeric dimer partners with which the protein units in the chimeric 

20 protein might otherwise interact. 

As used herein, plural nouns and verbs are intended to signify the singular 
form as well as the plural form of the particular noun or verb, unless prefixed by an 
adjective indicating a specific number, such as "two feet" or "three ligands", and a 
singular noim or verb is intended to include the plural form, unless prefixed by a 
25 phrase clearly indicating that only the singular noun or verb is intended, as in the 
phrase "one and only one foot" or "only one ligand." 



Each chimeric protein in the invention system is required to contain a 
dimerization domain of a member of the steroid/thyroid hormone nuclear receptor 
superfamily. As used herein, "dimerization domain" means a region of a member of 
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the steroid/thyroid hormone nuclear receptor superfamily containing a sequence of 
amino acids that functions to cause dimerization of two members of the 
steroid/thyroid hormone nuclear receptor superfamily. Members of the steroid/thyroid 
hormone nuclear receptor superfamily are commonly characterized by the presence of 
5 five domains: N-terminal or activation domain (A/B), DNA binding domain (C), hinge 
domain (D), ligand binding domain (E), and C-terminal domain (F) (Evans, R. Science 
240:889-895, 1988). The dimerization domain is generally located w^ithin the region 
of the receptor molecule that is referred to as including the D, E and F domains, or is 
referred to as the "D-E-F" domain. Typically the dimerization domain includes the 

10 complete ligand binding domain (E) and may optionally include all or part of the 

hinge domain (D) and/or the C-terminal region (F) of a member of the steroid/thyroid 
nuclear receptor superfamily, or a functional equivalent thereof. In some cases the 
dimerization domain may include at least a portion of the DNA binding domain itself 
Multiple domains of a given receptor can act in concert as v^ell as independently. 

1 5 Therefore, as employed herein, the term "dimerization domain of a member of the 
steroid/thyroid hormone nuclear receptor superfamily" refers to that portion (or 
portions) of a member of the steroid/thyroid hormone nuclear receptor superfamily 
that is involved in the formation of a dimer. 

As used herein, the term "fusion protein" means a genetically engineered 
20 molecule in v^hich two or more polypeptide imits are fused into a single polypeptide 
molecule by fusion of the open reading fi-ames (ORFs) encoding the two or more 
separate protein units into a single ORF. The invention fusion proteins are capable of 
forming a "functional entity" in the optional presence of ligand. When the fusion 
proteins contain two protein units, a "functional dimer (FD)" is formed by 
25 dimerization. 

As used herein, the term "functional dimer" or "functional entity" as applied 
to an invention chimeric protein means that the functional entity or dimer possesses at 
least some of the biological function of a dimer formed between two equivalent 
monomeric (i.e. imdimerized) polypeptide imits, or between two equivalent 
30 monomeric members of the steroid/thyroid hormone nuclear receptor superfamily. 
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The biological function of such dimers includes one or more of the following 
properties: DNA binding, ligand binding, transactivation, and dimerization properties 
related to transactivation of a promoter operatively associated with a response element 
responsive to the invention chimeric protein. For example, invention chimeric 
5 protein(s) can modulate transactivation of gene(s) whose expression is controlled by the 
presence of hgand (e.g. an invention FD wherein at least one member is a Bombyx mori 
ecdysone receptor can modulate the expression of a gene under the control of a Bombyx 
ecdysone response element). 

Therefore, the term "functional protein units" as applied to the functional 
10 entity or dimer formed by an invention chimeric protein means that the at least two 
protein units in the functional entity or dimer possess a cooperative function. For 
example, in a functional dimer the two dimerization domains (e.g., the two protein 
units) fold and interact with each other in a manner appropriate to substantially 
preserve one or more of the above named biological functions in the functional dimer 
15 that are present when corresponding monomeric members of the dimer come together 
under physiological conditions to form a native dimer complex. 

As used herein the term "endodimer" means a dimer formed in an orientation 
approximating that of a native dimer complex formed between equivalent monomeric 
polypeptides, i.e., an "internal" dimer. Figure 7 A illustrates a native dimer complex 
20 and Figure 7C illustrates an invention endodimer. 

As used herein the term "dimer partner" means any polypeptide that, xmder 
physiological conditions, forms a dimer with a member of the steroid/thyroid 
hormone nuclear receptor superfamily. Such dimer partners include, but are not 
limited to, monomeric member(s) of the steroid/thyroid hormone nuclear receptor 
25 superfamily, including those known in the art as a "silent partner," which are 
characterized by forming dimeric species with a member of the steroid/thyroid 
superfamily of receptors wherein the silent partner may not directly participate in 
binding ligand (i.e., only the co-partner in the fusion protein binds ligand). Exemplary 
dimer partner(s) include RXR, Usp, Nurrl, and the like. 
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The term "dimer partner" is meant to include members of the steroid/thyroid 
hormone nuclear receptor superfamily to which other wild type members preferentially 
bind to form heterodimeric species. For example, wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily preferentially form heterodimers 
5 with a common partner, the retinoid X (or 9-cis retmoic acid) receptor (RXR, see, for 
example, Yu etal, Cell, 67:1251-1266, 1991; Bugge etal, EMBOJ., 11:1409-1418, 
1992; iaiewererfl/.,A/afwre 355:446-449, 1992; Leidetal, Cell 68:377-395, 1992; 
Marks etal, EMBOJ. 11:1419-1435, 1992; Zhang etal. Nature 355:441-446, 1992; 
Issemann et al, Biochimie, 75:251-256, 1993). Additional dimer partners for members 
10 of the steroid/thyroid hormone nuclear receptor superfamily include ultraspiracle (Usp), 
famesoid X receptor (FXR), and the like. 

As used herein, the phrase "member(s) of the steroid/thyroid hormone nuclear 
receptor superfamily" (also known as "intracellular receptors" or "the nuclear receptor 
superfamily") refers to hormone binding proteins that operate as Ugand-dependent 
1 5 transcription factors, including identified members of the steroid/thyroid hormone 
nuclear receptor superfamily for which specific ligands have not yet been identified 
(referred to in the art as "orphan receptors"). 

Exemplary members of the steroid/thyroid hormone superfamily of receptors 
(including the various isoforms thereof) include steroid receptors such as 

20 glucocorticoid receptor (GR), mineralocorticoid receptor (MR), estrogen receptor 
(ER), progesterone receptor (PR), androgen receptor (AR), vitamin D3 receptor 
(VDR), and the like; plus retinoid receptors, such as the various isoforms of retinoic 
acid receptor (e.g., RARa, RARP or RARy)» the various isoforms of retinoid X (or 9- 
cis retinoic acid) receptor (e.g., RXRa, RXRP, or RXRy), various isoforms of 

25 peroxisome proliferator-activated receptors (e.g., PPARa, PPARy, PPAR6) and the 
like (see, e.g., U.S. Patent Nos. 4,981,784; 5,171,671; and 5,071,773); thyroid 
hormone receptor (T3R), such as TRa, TRP, and the like; steroid and xenobiotic 
receptor (SXR, see for example, Blumberg et al.. Genes Dev (1998) 
12(20):3 195-205), RXR-interacting proteins (RIPs; see, e.g., Seol et al., Mol 

30 Endocrinol (1995) 9ll}:72-85; Zavacki et al., Proc Natl Acad Sci USA (1997) 
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94115}:7909-14) including famesoid X receptor (FXR; see for example, Forman et al.. 
Cell (1995) 81i5}:687-93; Hanley et al, J Clin Invest (1997) 100(3:^ :705-12. O'Brien 
et al., Carcinogenesis (1996) I7{2): 185-90), pregnenolone X receptor (PXR; see for 
example, Schuetz et al., Mol Pharmacol (1998) 54(6;) : 11 13-7), liver X receptor (LXR, 
5 see, e.g., Peet et al., Curr Opin Genet Dev (1 998) 8(5):571-5), BXR (Blumberg et al., 
Genes Dev (1998) 12(9): 1269-77), insect derived receptors such as the ecdysone 
receptor (EcR), the ultraspiracle receptor (see, for example, Oro et al., in Nature 
347:298-301 (1990)), and the like; as well as other gene products which, by their 
structure and properties, are considered to be members of the superfamily, as defined 
10 hereinabove, including the various isoforms thereof (see, e.g., Laudet, V., J Mol 
Endocrinol (1997) 19a}:207-26). 

Examples of orphan receptors contemplated for use herein include HNF4 (see, 
for example, Sladek et al.. Genes & Development 4:2353-2365 (1990)), the COUP 
family of receptors (see, for example, Miyajima et al., in Nucleic Acids Research 

15 16:1 1057-1 1074 (1988), and Wang et al.. Nature 34Q:163-166 (1989)), COUP-like 
receptors and COUP homologs, such as those described by Mlodzik et al.. Cell 
60:211-224 (1990) and Ladias et al.. Science 251:561-565 (1991), orphan receptor 
(ORl; see, e.g., Feltkamp et al, J Biol Chem (1999) 274(15): 1042 1-9), the insect 
derived knirps and knirps-related receptors, short heterodimer partner (SHP; see, e.g., 

20 Seol et al., Mol Cell Biol (1997) 17£12):7 126-31), hepatocyte nuclear receptor 4 
(HNF4), constitutive androstane receptor (CAR; see, e.g., Forman et al.. Nature 
(1998) 395(6702^ :612-51 and the like. 

Each protein unit in the invention chimeric protein is required to contain at least 
a dimerization domain, optionally, the entire ligand binding domain, an optional hinge 

25 domain, and an optionally ftmctional DNA binding domain of a member of the 

steroid/thyroid nuclear receptor superfamily, or a functional equivalent thereof For 
use in the invention methods for modulating the transcription of exogenous or 
endogenous nucleic acids in a host, the hgand binding domains are either endogenous or 
non-endogenous to a host, with the latter including ligand binding domains that are 

30 modified to be non-responsive to ligands endogenous or native to the host. In 
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embodiments wherein the hgand binding domain is derived from non-mammalian 
member(s) of the steroid/thyroid hormone nuclear receptor superfamily, which members 
are not normally present in the cells of a host, the ligand binding domains are preferably 
derived from Ihe carboxy-terminal portion of non-mammaHan members. Exemplary 
5 members that are not normally present in mammalian cells include insect, avian, 

amphibian, reptilian, fish, plant, bacteria, viral and ftingal (including yeast) members of 
the steroid/thyroid hormone nuclear receptor superfamily, and the like. 

Exemplary Ugand binding domains derived from insect receptors include those 
derived from lepidopteran species such as Drosophila melanogaster (M.R. Koelle, 
1995), Bombyx mori (Swevers et al, Insect Biochem. Molec. Biol, 25(7):857-866, 
1995), Choristoneurafumiferana (Palli et al. Insect Biochem. Molec. Biol, 26(5):485- 
499, 1996), Manduca sexta (¥\x]VNQ3LdL et al. Insect Biochem. Molec. Biol, 25(7):845- 
856, 1995\ Aedes aegypti (Cho etal, Insect Biochem Molec. Biol, 25:19-27, 1995), 
Chorinomus tentans (Imhof etal. Insect Biochem. Molec. Biol, 25:115-124, 1993), and 
the like. 

When the fimctional protein units included in the invention chimeric protein lack 
a substantial portion of the C-terminal "F" domain in the dimerization domain of a native 
member of the steroid/thyroid hormone nuclear receptor superfamily, a fimctional 
protein unit that is less than about 700 amino acids in length is provided. 

Ligand binding domains can be fianctionally located in either orientation and at 
various positions within the protein unit. For example, the ligand binding domain can be 
positioned at either the amino or carboxy terminus of the protein unit in the invention 
chimeric protein, or therebetween. In a preferred embodiment of the present invention, 
25 the ligand binding domain is positioned at the carboxy terminus of the protein unit (see 
Figure 1). 

The optional hmge region, when present, can also be fimctionally located in 
either orientation and at various positions within the protein unit. For example, the hinge 
region can be positioned at either the amino or carboxy terminus of the protein unit, or 
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therebetween. Preferably, the hinge region is positioned internally between the ligand 
binding and DNA binding domains of one or more of the members in the chimeric 
protein. The hinge region bounded by the ligand binding domain and DNA binding 
domain of the native Bombyx mori receptor (BEcR), specifically, about 27 amino acid 
5 residues (i.e. amino acid residues 283-309, in the hinge region of BEcR) are sufficient to 
confer high affinity for complex formation with an endogenous dimer partner (see U.S. 
Patent Application Serial No. 08/891,298, filed July 10, 1997, copending herewith). 

Each protein unit in the invention chimeric protein also optionally contains a 
DNA binding-domain. DNA-binding domains contemplated for use in the preparation 

10 of invention chimeric proteins are well known in the art and are typically obtained firom 
DNA-binding proteins (e.g., transcription factors). The term "DNA-binding domam" is 
understood in the art to refer to an amino acid sequence that is able to bind to DNA. As 
used herein, the term "DNA-binding domain" encompasses a minimal peptide sequence 
of a DNA-binding protein up to the entire length of a DNA-binding protein, so long as 

1 5 the DNA-binding domain functions to associate with a particular regulatory element. 

DNA-binding domains are known to function heterologously in combination 
with other functional domains by maintaining the ability to bind the natural DNA 
recognition sequence (see, e.g., Brent and Ptashne, Cell, 43:729-736, 1985). For 
example, with respect to steroid/thyroid hormone nuclear receptors, DNA-binding 

20 domains are interchangeable, thereby providing numerous chimeric receptor proteins 

(see, e.g., U.S. Patent 4,981,784; and R. Evans, Science, 240:889-895, 1988). Similar to 
the ligand binding domain, the DNA-binding domain can be positioned at either the 
carboxy terminus or the amino terminus of a protein unit in tiie invention chimeric 
protein, or the DNA-binding domain can be positioned between the ligand binding 

25 domain and the activation domain. In preferred embodiments of the present invention, 
the DNA-binding domain is positioned intemally between the ligand binding domain 
and the activation domain, 

"DNA-binding proteui(s)" contemplated for use herein belong to the well-known 
class of proteins that are able to directly bind DNA and facilitate initiation or repression 
30 of transcription. Exemplary DNA-binding proteins contemplated for use herein include 
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transcription control proteins (e.g., transcription factors and the like; see, for example, 
Conaway and Conaway, Transcription Mechanisms and Regulation, Raven Press Series 
on Molecular and Cellular Biology, Vol. 3, Raven Press, Ltd., New York, NY, 1994). 

Transcription factors contemplated for use herein as a source of such DNA 
5 binding domains include, e.g., homeobox proteins, zinc finger proteins, hormone 

receptors, helix-tum-helix proteins, helix-loop-helix proteins, basic-Zip proteins (bZip), 
P-ribbon factors, and the like. See, for example, S. Harrison, "A Structural Taxonomy of 
DNA-binding Domains," Nature, 353:715-719. Homeobox DNA-binding proteins 
suitable for use herein include, for example, HOX, STF-1 (Leonard et al, Mol. Endo., 

10 7:1275-1283, 1993), Antp, Mat a-2, INV, and the like. See, also, Scott et al. Biochem. 
Biophys. Acta, 989 :25-48. 1989. It has been found that a fi-agment of 76 amino acids 
(corresponding to amino acids 140-21 5 described in Leonard et al , 1993) containing the 
STF-1 homeodomain binds DNA as tightly as wild-type STF-1 . Suitable zinc finger 
DNA-binding proteins for use herein include Zi£268, GLl, XFin, and the like. See also, 

1 5 Klug and Rhodes, Trends Biochem. Sci. , 12:464, 1 987; Jacobs and Michaels, New Biol. , 
2:583, 1990; and Jacobs, 11:4507-4517, 1992. 

An additional DNA binding domain contemplated for use in the practice of the 
present invention is the GAL4 DNA binding domain. The DNA binding domain of the 
yeast GAL4 protein comprises at least the first 74 amino terminal amino acids thereof 
20 (see, for example, Keegan et al. , Science 231:699-704, 1 986). Preferably, the first 90 or 
more amino terminal amino acids of the GAL4 protein will be used, for example, the 
147 amino terminal amino acid residues of yeast GAL4 . 

The DNA-binding domain(s) used in the invention chimeric proteins can be 
obtained from a member of the steroid/thyroid hormone nuclear receptor superfamily, or 
25 are substantially the same as those obtained fi-om a member of the superfamily. The 
DNA-binding domains of all members of the steroid/thyroid hormone nuclear receptor 
superfamily are related. Such domains consist of 66-68 amino acid residues, and possess 
about 20 invariant amino acid residues, including nine cysteines. Members of the 
superfamily are characterized as proteins which contain these 20 invariant amino acid 
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residues. The highly conserved amino acids of the DNA-binding domain of members of 
the superfamily are as follows: 

Cys-X-X-Cys-X-X-Asp*-X-Ala*-X-Gly*- 
X - Tyr* - X - X - X - X - Cys - X - X - Cys - Lys* - X - 
Phe-Phe-X-Arg*-X-X-X-X-X-(X-X-)Cys- 
X-X-X-X-X-(X-X-X-)Cys-X-X-X-Lys-X- 
X - Arg - X - X - Cys - X - X - Cys - Arg* - X - X - 
Lys* - Cys - X - X - X - Gly* - Met (SEQ ID N0:1); 

wherein X designates non-conserved amino acids within the DNA-binding domain; an 
asterisk denotes the amino acid residues which are ahnost universally conserved, but for 
which variations have been found in some identified hormone receptors; and the residues 
enclosed in parenthesis are optional residues (thus, the DNA-binding domain is a 
minimum of 66 amino acids in length, but can contain several additional residues). 

Invention chimeric proteins are optionally modified by the introduction of an 
activation domain subunit. Activation domains contemplated for use in the practice of 
the present invention are well known in the art and can readily be identified by those of 
skill in the art. Such activation domains are typically derived fi-om transcription factors 
and comprise a contiguous sequence that functions to activate gene expression when 
associated with a suitable DNA-binding domain and a suitable Ugand binding domain. 
An activation domain can be positioned at any convenient site within the invention 
chimeric protein, e. g., at the carboxy terminus, the amino terminus, or between the 
ligand binding domain and the DNA binding domain v^ithin one or both protein units of 
the chimeric protein. In presently preferred embodiments of the invention, the activation 
domain is positioned at the amino terminus of the invention chimeric protein. 

Suitable activation domains can be obtained fi-om a variety of sources, e.g., firom 
the N-terminal region of members of the steroid/thyroid hormone nuclear receptor 
superfamily, ftom transcription factor activation domains, such as, for example, VP 16, 
GAL4, NF-kB or BP64 activation domains, and the like. The activation domain 
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presently preferred for use in the practice of the present invention is obtained from the C- 
terminal region of the VP 16 protein, and is known as VP16x. 

In a presently preferred embodiment of the present invention, chimeric 
proteins contain one or more ecdysone receptors (EcR) as the steroid/thyroid hormone 
5 nuclear receptor, for example, a Drosophila EcR (DEcR) or a Bomhyx EcR (BEcR). 
The chimeric protein further comprises either RXR or ultraspiracle protein (Usp) as 
an additional functional protein unit. The preferred order wdthin the chmieric protein 
is for the EcR to be located at the amino terminus of the chimeric protein. However, 
when the invention chimeric protein further comprises an activation domain, the 
10 activation domain is preferably located at the amino terminus of the chimeric protein. 

The EcR, an insect receptor, differs in two respects from other known 
steroid/thyroid hormone nuclear receptor superfamily. First, EcR has very different 
documented relationships with two similar dimer partners: its natural partner, Usp, 
and the mammalian homolog, RXR. EcR also differs from other members of the 
15 steroid/thyroid hormone nuclear receptor superfamily in that its apparent affinity to 
these heterodimer partners varies depending on the presence of ligand. 

To facilitate dimerization of the dimerization domains in the invention 
chimeric proteins, the at least two protein units of the chimeric protein preferably 
have the ligand binding domain, hinge domain, and DNA binding domain in the same 
20 order within each protein unit. If the chimeric protein additionally contains an 

activation domain, the activation domain is preferably located at the amino terminus 
of the chimeric protein, ahead of the first imit thereof, as illustrated in Examples 1 and 
4 herein. 

Invention chimeric protein(s) optionally further contain a linker interposed 
25 between one or more of the protein units. The protein units can be independently 
oriented amino terminus to carboxy terminus within the chimeric protein, or visa 
versa . For example, the linker can be placed between the carboxy terminus of the 
first protein unit and the amino terminus of the second protein unit. Any type of 
linker known in the art can be used for linking the protein units in invention chimeric 
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proteins so long as the linker is flexible and does not interfere with dimerization 
between protein units in the invention chimeric proteins. 

In one embodiment according to the present invention, the linker is a 
heterobifunctional cleavable cross-linker, such as N-succinimidyl (4-iodoacetyl)- 

5 aminobenzoate; sulfosuccinimidyl(4-iodoacetyl)-aminobenzoate; 4-succinimidyl- 
oxycarbonyl-a-(2-pyridyldithio) toluene ; sulfosuccininiidyl-6-[a-methyl-a- 
(pyridyldithiol)-toluaniido] hexanoate; N-succinimidyl-3 -(-2-pyridyldithio)- 
proprionate; succinimidyl-6-[3(-(-2-pyridyldithio)-proprionamido] hexanoate; 
sulfosuccinimidyl-6-[3(-(-2-pyridyldithio)-propionamido] hexanoate; 3-(2- 

10 pyridyldithio)-propionyl hydrazide, EUman's reagent, dichlorotriazinic acid, S-(2- 
thiopyridyl)-L-cysteme, and the like. Further bifunctional linkmg compounds are 
disclosed in U.S. Patent Nos. 5,349,066. 5,618,528, 4,569,789, 4,952,394, and 
5,137,877, each of which is incorporated herein by reference in its entirety. These 
chemical linkers can be attached to purified proteins using numerous protocols known 

15 in the art, such as those described in Pierce Chemicals "Solutions, Cross-linking of 
Proteins: Basic Concepts and Strategies," Seminar #12, Rockford, IL. 

In another embodiment according to the present mvention, the linker can be a 
peptide having from about 2 to about 60 amino acid residues, for example from about 
5 to about 40, or from about 10 to about 30 amino acid residues, such as is known in 

20 single-chain antibody research. Examples of such known linker moieties include 
GGGGS (SEQ ID N0:2), (GGGGS)„ (SEQ. ID NO:3), GKSSGSGSESKS (SEQ ID 
NO:4), GSTSGSGKSSEGKG (SEQ. ID NO:5), GSTSGSGKSSEGSGSTKG (SEQ 
ID N0:6), GSTSGSGKSSEGKG (SEQ ID NO:7), GSTSGSGKPGSGEGSTKG 
(SEQ ID NO:8), EGKSSGSGSESKEF (SEQ ID NO:9), SRSSG (SEQ. ID NO: 10), 

25 SGSSC (SEQ ID NO: 1 1 ), and the like. A Diphtheria toxin trypsin sensitive Imker 
having the sequence AMGRSGGGCAGNRVGSSLSCGGLNLQAM (SEQ ID 
NO: 12) is also useful. Alternatively, the peptide linker moiety can be VM or AM, or 
have the structure described by the formula: AM(G2 to 4S)xAM wherein X is an integer 
from 1 to 11 (SEQ ID NO: 13). Additional linking moieties are described, for 

30 example, in Huston et al. , PNAS 85:5879-5883, 1988; Whitlow, M., et al. , Protein 
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Engineering 6:989-995, 1993; Newton et al. Biochemistry 35:545-553, 1996; A. J. 
Cumber era/., Bioconj. Chem. 3:397-401, 1992; Ladumer e/ a/., J. Mol. Biol. 
273:330-337, 1997; and U.S. Patent. No. 4,894,443, the latter of which is incorporated 
herein by reference in its entirety. 

5 Generally, however, the linker contains from about 5 to about 245 amino 

acids, although there is no theoretical upper limit on the number of amino acids that 
could be used in the linker. Preferably, the linker contains from about 53 to about 125 
amino acids. The amino acids in the linker protein are preferably selected to provide 
flexibility to the linker. Preferably, a multiplicity of flexibility enhancing amino 

10 acids, such as proline, glycine, alanine and serine, are incorporated into the Unker to 
enhance its flexibility. 

Assuming a span of approximately 3.35 angstroms per amino acid within the 
flexible peptide bridge encoded by a 36-base pair Sfil compatible oligonucleotide, the 
predicted minimum and maximum distance for the lengths of the linker having from 0 

15 to 20 linker segments ranges from about 16.75 angstroms (the 5 amino acid bridge) to 
804 angstroms (20-linker segments + the 5 amino acid bridge). Thus, the length of 
the Hnker can readily be selected to enhance dimerization between any two particular 
members acting as dimer partners by including as many linker segments as is 
preferred to enhance the biological fimctions of the functional dimer, as discussed 

20 herein. 

In a presently preferred embodiment, the nucleotide encoding the polypeptide 
linker contains a restriction endonuclease recognition site which produces an 
overhang composed of non-palindromic center bases to allow for insertion of 
compatible inserts in a uniform orientation and in continually in-frame blocks along 
25 the length of the polypeptide linker. This type of linker allows incremental expansion 
of the Unker peptide to produce chimeric proteins containing linkers with a range of 
distances between the protein units. The nucleotide encoding the linker preferably 
contains a rare 8-base-pair Sfil recognition site that is usefiil in making constructs 
with linkers of variable length. In addition, the nucleotides composing the recognition 



22 



-GGCCNNNNNGGCC- (SEQ ID NO: 14) 

are guanidines and cytosines, which can be oriented in frame to encode glycine and 
proline residues in accordance with the criteria of producing a "flexible" protein 
linker for junction of the two units of the chimeric protein. Any bases can be used as 
5 the "N" nucleotides contained within the recognition site, allowing further flexibility 
in the design of the linker. 

A presently preferred linker amino acid sequence is GPGGGSGGGSGT (SEQ 
ID NO: 15), which provides a high degree of predicted flexibility while minimizing 
repetitive sequence within the encoding oligonucleotide. 

10 In accordance with another embodiment of the present invention, there are 

provided nucleotides encoding invention chimeric protein(s) and cells containing such 
nucleotides. Cells containing invention polynucleotides can be either mammalian or 
non-mammalian, for example, plant or fungi cells, and the like. 

In accordance with another embodiment of the present invention, there are 
15 provided methods for modulating the expression of exogenous gene(s) in a subject 
organism containing: 

1) a functional entity according to the invention and 

2) a DNA construct encoding and expressing the exogenous 
gene(s) under the control of a response element responsive to the functional 

20 dimer. 

Invention methods for modulating exogenous gene expression in such a subject 
organism comprise administering to the subject organism an effective amount of at 
least one ligand for the functional dimer. 

Ligand is selected to activate at least one functional unit of the functional entity. 
25 When the functional entity is a functional duner, the ligand is usually selected to activate 
the dominant member of the functional dimer. For example, if one of the two members 
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in the functional dimer is a silent dimer partner, the ligand is selected to activate the 
member in the functional dimer that does not act as a silent dimer partner. 

In a presently preferred embodiment of the present invention, one of the 
members in the functional entity is from an insect species, and the preferred ligand is an 
5 insect hormone. For example, preferred insect receptors are the Drosophila ecdysone 
receptor or the Bombyx ecdysone receptor, and the preferred dimer partner with these 
insect receptors in the invention chimeric protein is either the ultraspiracle protein or a 
retinoid X receptor. These functional entities complex vnth the ecdysone response 
element, generally in the presence of ligand for the functional dimer formed by the 
10 invention chimeric protein, but in some instances the presence of Ugand is not required 
to form a functional entity/response element complex, as explained more fully 
hereinbelow. 

As employed herein, the terms "modulate" and "modulating" refer to the ability 
of a given functional entity to activate/deactivate and/or up-regulate/down-regulate 
1 5 transcription of exogenous nucleic acids, relative to the transactivation activity in the 
absence of the functional entity. 

The actual effect of an invention functional entity on the transcription of 
exogenous or endogenous nucleic acids WiW vary depending on the particular 
combination of dimerization domains and/or members of the steroid/thyroid hormone 

20 nuclear receptor superfamily in the chimeric protein, on the presence or absence of 

specific hgand for the ligand binding domain(s) employed in the chimeric protein, and 
on the regulatory element (e.g., response element) v^dth which the selected chimeric 
protein interacts. It is specifically contemplated within the scope of the present invention 
that modulation includes repression of expression of one or more genes. Such repression 

25 can be either ligand-dependent repression or repression that occurs independently of the 
presence of a ligand. Thus, there are four types of modulation contemplated within the 
scope of the invention: ligand-dependent induced modulation, ligand-dependent 
repressed modulation, ligand-independent induced modulation, and ligand independent 
repressed modulation. The ligand can be either exogenous or endogenous to the subject 

30 treated for modulation of expression of an exogenous gene. 



24 

More particularly, the type of modulation that results from the practice of the 
invention method (i.e., whether activation or repression of expression of the exogenous 
gene) depends upon the combination of dimerization domains and/or members of the 
steroid/thyroid hormone nuclear receptor superfamily contained within a functional 
5 entity formed by the invention chimeric protein. For example, it has been determined 
that activation of expression of the exogenous gene(s) according to the invention 
modulation method can be achieved if the dimer partners in an invention FD used in the 
invention method of modulation are an ecdysone receptor and a retinoid X receptor, e.g. 
EOR or E5R. Ligands suitable for activating expression of the exogenous gene(s) when 
10 such functional dimers are employed in the invention methods include muristerone A, 
20-hydroxyecdysone, phytoecdysteroid(s), and the like. 

On the other hand, it has been determined that expression of an exogenous 
gene(s) can be repressed independently of (i.e., with or without) the presence of ligand if 
an invention FD comprising an ecdysone receptor (e.g., either a DrosophUa ecdysone 
receptor or a Bombyx ecdysone receptor) and an ultraspiracle protein as dimer partner is 
used in the invention method, e.g., E5U, and the like. Ligands suitable for activating 
expression of the exogenous gene(s) when such functional dimers are employed in the 
invention methods are 20-hydroxyecdysone, muristerone A, phytoecdysteroid(s), and the 
like. In addition, expression of exogenovis gene(s) can be repressed independently of 
(i.e., with or without) the presence of ligand if an invention FD comprising a Bombyx 
ecdysone receptor and an RXR as dimer partner is used in the invention method. As 
shown in Example 6 herein, an N-terminal His tag on the chimeric protein to aid in 
protein purification does not effect binding of the chimeric protein to a suitable response 
element so as to repress expression of the exogenous gene according to invention 
methods for modiilating expression of exogenous gene(s). 

Accordingly, in another embodiment of the present invention, there are provided 
methods for modulating (i.e., either activatmg or repressing) the expression of one or 
more genes in a subject organism independently of the presence of ligand for the 
invention chimeric protein. If the subject organism contains an invention chimeric 
30 protein, the invention method comprises introducing to the subject an exogenous 



25 

response element(s) with which the chimeric protein interacts and which controls 
expression of the one or more genes, thereby modulating expression of the gene(s) 
independent of the presence of ligand for the chimeric protein. On the other hand, if 
the subject organism contains an exogenous response element(s) controlling 
5 expression of the one or more genes, the invention method comprises introducing to 
the subject an invention chimeric protein with which the response element interacts, 
thereby modulating expression of the gene(s) independent of the presence of ligand 
for the chimeric protein. 

In accordance with another embodiment of the present invention, there are 
provided methods for modiilating (i.e., either activating or repressing) the expression of 
one or more exogenous genes independent of Ugand for the chimeric protein. If the 
subject contains a chimeric protein according to the invention,, the invention method 
comprises introducing to the subject an effective amount of a response element, wherein 
the response element is responsive to the chimeric protein and wherein the modulation is 
mdependent of Ugand for the chimeric protein. The modulation can be ligand 
independent activation or ligand independent repression. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating (i.e., either activating or repressing) the expression of 
one or more exogenous genes in a cell containing: 

20 1) an invention chimeric protein and 

2) a DNA construct comprising the exogenous gene under the 
control of a response element with which the chimeric protein interacts, 
wherein said response element controls expression of the exogenous gene, 
said method comprising administering to the cell an effective amount of an 

25 exogenous ligand for at least one functional unit of the chimeric protein. 

In accordance with another embodiment of the present invention, there are 
provided methods for modxilating the expression of one or more genes in a subject 
organism containing an endogenous response element controUing expression of one or 
more genes. The invention method in this situation comprises introducing to the 



26 



subject an invention chimeric protein, wherein the chimeric protein interacts with the 
response element, thereby modulating expression of the gene(s) dependent on the 
presence of endogenous ligand therefor. The chimeric protein is encoded by an 
inducible DNA construct and the modulating comprises inducing expression of the 
5 gene(s). 

In another embodiment according to the present invention, there are provided 
methods for modulating the expression of one or more genes in a subject organism 
containing an endogenous response element controlling expression of one or more 
genes and an endogenous ligand. The invention method comprises introducing to the 

10 subject an invention chimeric protein that interacts with the endogenous ligand and 
wherein the chimeric protein interacts with the response element, thereby modulating 
expression of the gene(s) dependent on the presence of the endogenous ligand. If the 
invention chimeric protein is encoded by an inducible DNA construct, the modulating 
further comprises inducing expression of the chimeric protein. This embodiment of 

15 the invention is especially useful for controlling expression of an exogenous gene that is 
under the control of an endogenous response element wherein the ligand for the 
invention functional dimer is also endogenous. 

Response elements contemplated for use in the practice of the present invention 
(relating to modulation of the expression of exogenous genes in a subject) include native, 

20 as well as modified response elements. For example, since invention functional dimers 
can fimction as either homodimers or as heterodimers (with a silent partaer therefor), any 
response element that is responsive to an invention functional dimer, in the form of a 
homodimer or heterodimer, is contemplated for use in the invention methods described 
herein. As is readily recognized by those of skill in the art, invention fimctional dimers 

25 (whether in the form of a homodimer or a heterodimer) can bind to a response element 
having an inverted repeat motif (i.e., two or more half sites in mirror image orientation 
with respect to one another), to a response element having a direct repeat motif, and the 
like. 
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Response elements useful in conjunction with invention ftmctional entities are 
those well known in the art. As readily recognized by those of skill in the art, the 
response element employed will vary as a function of the protein units incorporated into 
the functional entity. Thus, for example, retinoic acid receptor response elements are 
5 composed of at least one direct repeat of two or more defined half sites separated by a 
spacer of five nucleotides. The spacer nucleotides can independently be selected from 
any one of A, C, G or T. Each half site of response elements contemplated for use in 
the practice of the invention comprises the sequence: 

-RGBNNM-, 
10 wherein 

R is selected from A or G; 
B is selected from G, C, or T; 

each N is independently selected from A, T, C, or G; and 
M is selected from A or C; 

15 with the proviso that at least 4 nucleotides of said -RGBNNM- sequence are identical 
with the nucleotides at corresponding positions of the sequence -AGGTCA-. Response 
elements employed in the practice of the present invention can optionally be preceded 
by Nx, wherein x falls in the range of 0 up to 5. 

For example, thyroid hormone receptor response elements can be composed of 
20 the same half site repeats, with a spacer of four nucleotides. Alternatively, palindromic 
constructs as have been described in the art are also functional as TR response elements. 

Exemplary GAL4 response elements are those containing the palindromic 17- 

mer: 

5'-CGGAGGACTGTCCTCCG-3' (SEQ ID NO: 16), 

25 such as, for example, 17MX, as described by Webster et al., in Cell 52:169-178 (1988), 
as well as derivatives thereof. Additional examples of suitable response elements 
mclude those described by Hollenberg and Evans in Cell 55:899-906 (1988); or 
Webster et al. m Cell 54:199-207 (1988). 



28 



Ecdysone response element sequences are preferred for use herein with 
functional dimers containing an ecdysone receptor function in a position- and 
orientation-independent fashion. The native ecdysone response element has been 
previously described, see, e.g., Yao et al.. Cell, 71:63-72, 1992. 

In the invention methods the operative response element is fimctionally linked 
to an operative exogenoixs gene(s) whose expression it is desirable to control. The word 
"operative" means that the respective DNA sequences (represented by the terms 
"response element" and "exogenous or endogenous gene") are operational, i.e., work 
for their intended purposes; the word "functionally" means that after the two segments 
are linked, upon appropriate activation by a functional dimer/ligand complex, the 
exogenous gene(s) will be expressed as the result of the fact that the "response element" 
was "turned on" or otherwise activated. 

Certain nucleic acid constructs contemplated for use in one aspect of the present 
invention include promoters and regulatory elements operatively associated with 
exogenous nucleic acids. In one embodiment of the present invention, the invention 
functional dimer, in the presence of a ligand therefor, binds the regulatory element and 
activates transcription of one or more exogenous nucleic acids. For example, an 
invention fimctional dimer containing the protein units RXR and EcR will transactivate 
an ecdysone response element-containing promoter in the presence of the hormone 
ecdysone, or the synthetic analog, muristerone A. 

Regulatory elements contemplated for use in the practice of the present invention 
include elements responsive to the invention receptor peptide. In a preferred 
embodiment of the present invention, such elements are exogenous regulatory elements 
not normally present in the cells of the host. One class of exogenous regulatory elements 
25 contemplated for use herein includes hormone response elements that modulate 

transcription of exogenous nucleic acid when bound to the DNA binding domain of an 
invention receptor peptide. 

Regulatory elements employed in the practice of the present invention are 
operably linked to a suitable promoter for transcription of exogenous nucleic acid(s) 
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product(s). As used herein, the term "promoter" refers to a specific nucleotide sequence 
recognized by RNA polymerase, the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be specifically initiated under proper 
conditions. When exogenous nucleic acid(s), operatively linked to a suitable promoter, 
5 is(are) introduced into the cells of a suitable host, expression of the exogenous nucleic 
acid(s) is(are) controlled in many, but not all cases, by the presence of ligands, which are 
not normally present in the host cells. 

Promoters contemplated for control of expression of exogenous nucleic acids 
employed in the practice of the present invention include mducible (e.g., minimal CMV 
10 promoter, minimal TK promoter, modified MMLV LTR), constitutive (e.g., chicken (3- 
actin promoter, MMLV LTR (non-modified), DHFR), and/or tissue specific promoters. 

Inducible promoters contemplated for use in the practice of the present invention 
comprise transcription regulatory regions that function maximally to promote 
transcription of mRNA under inducing conditions. Examples of suitable inducible 

1 5 promoters include DNA sequences corresponding to: the E. coli lac operator responsive 
to IPTG (see Nakamura et al. , Cell, 1 8 : 11 09- 11 1 7, 1 979); the metallothionein promoter 
metal-regulatory-elements responsive to heavy-metal (e.g., zinc) induction (see Evans et 
al, U.S. Patent No. 4,870,009), the phage T71ac promoter responsive to IPTG (see 
Studier etal.,Meth. EnzymoL, 185: 60-89, 1990; and U.S. Patent No. 4,952,496), the 

20 heat-shock promoter; the TK minimal promoter; the CMV minimal promoter; a 
synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use in the practice of the 
present invention include the CMV promoter, the S V40 promoter, the DHFR promoter, 
the mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine 

25 leukemia virus (MMLV) promoter, elongation factor la (EFla) promoter, albumin 
promoter, APO Al promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CDS promoter, immunoglobulin light or heavy chain promoters, 
neurofiliment promoter, neuron specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, thymidine kinase (TK) 

30 promoter, RNA Pol II promoter, MYOD promoter, MYF5 promoter. 
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phosphoglycerokinase (PGK) promoter, Stfl promoter. Low Density Lipoprotein (LDL) 
promoter, chicken b-actin promoter (used in conjunction with ecdysone response 
element), and the like. 

As readily understood by those of skill in the art, the term "tissue specific" refers 
5 to the substantially exclusive initiation of transcription in the tissue from which a 
particular promoter that drives expression of a given gene is derived (e.g., expressed 
only in T-cells, endothelial cells, smooth muscle cells, and the like). Exemplary tissue 
specific promoters contemplated for use in the practice of the present invention include 
the GH promoter, the NSE promoter, the GFAP promoter, neurotransmitter promoters 
10 (e.g., tyrosine hydroxylase, TH, choline acetyltransferase, ChAT, and the like), 

promoters for neurotropic factors (e.g., a nerve growth factor promoter, NT-3, BDNF 
promoters, and the Uke), and so on. 

As used herein, when referring to nucleic acids, the phrase "exogenous to said 
mammalian host" or simply "exogenous" refers to nucleic acids not naturally found at 

1 5 levels sufficient to provide a function in the particular cell where transcription is desired. 
For example, exogenous nucleic acids can be either natural or synthetic nucleic acids, 
which are introduced into the host in the form of DNA or RNA. The nucleic acids of 
interest can be introduced into target cells (for in vitro applications), or the nucleic acids 
of interest can be introduced directly or indurectly into a host, for example, by the 

20 transfer of transformed cells into a host. 

In contrast to exogenous nucleic acids, the phrase "endogenous nucleic acids" or 
"endogenous genes" refers to nucleic acids naturally found at levels sufficient to provide 
a function in the particular cell where transcription is desired. 

Exogenous nucleic acids contemplated for use in the practice of the present 
25 invention include wild tj^e and/or therapeutic nucleic acids. "Wild type" genes are 

those that are native to cells of a particular type. Exemplary wild type nucleic acids are 
genes which encode products the substantial absence of which leads to the occurrence of 
a non-normal state in a host; or a substantial excess of which leads to the occurrence of a 
non-normal state in a host. 
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Such genes may not be expressed in biologically significant levels or may be 
imdesirably overexpressed. Thus, for example, while a synthetic or natural gene coding 
for human insulin would be exogenous genetic material to a yeast cell (since yeast cells 
do not naturally contain insulin genes), a human insulin gene inserted into a human skin 
5 fibroblast cell would be a wild type gene with respect to the fibroblast since human skin 
fibroblasts contain genetic material encoding human insulin, although human skin 
fibroblasts do not express human insulin in biologically significant levels. 

Therapeutic nucleic acids contemplated for use in the practice of the present 
invention include those which encode products which are toxic to the cells in which they 
1 0 are expressed; or encode products which impart a beneficial property to a host; or those 
which transcribe nucleic acids which modulate transcription and/or translation of 
endogenous genes. 

As employed herein, the phrase "therapeutic nucleic acids" refers to nucleic 
acids that impart a beneficial fimction to the host in which such nucleic acids are 

1 5 transcribed. Therapeutic nucleic acids are those that are not naturally found in host cells. 
For example, synthetic or natural nucleic acids coding for wild type human insulin 
would be therapeutic when inserted into a skin fibroblast cell so as to be expressed in a 
human host, where the human host is not otherwise capable of expressing fimctionally 
active human insulin in biologically significant levels. Further examples of therapeutic 

20 nucleic acids include nucleic acids that transcribe antisense constructs used to suppress 
the expression of endogenous genes. Such antisense transcripts bind endogenous nucleic 
acid (mRNA or DNA) and effectively cancel out the expression of the gene. In 
accordance with the methods described herein, therapeutic nucleic acids are expressed at 
a level that provides a therapeutically effective amount of the corresponding therapeutic 

25 protein. 

Exogenous nucleic acids usefiil in the practice of the present invention include 
genes that encode biologically active proteins of interest, such as, e.g., secretory proteins 
that can be released fi-om said cell; enzymes that can metabolize a toxic substance to 
produce a non-toxic substance, or that metabolize an inactive substance to produce a 
30 usefiil substance; regulatory proteins; cell surface receptors; and the like. Usefiol genes 
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include genes that encode blood clotting factors, such as human factors VIII and IX; 
genes that encode hormones, such as insulin, parathyroid hormone, luteinizing hormone 
releasing factor (LHRH), alpha and beta seminal inhibins, and human growth hormone; 
genes that encode proteins, such as enzymes, the absence of which leads to the 

5 occurrence of an abnormal state; genes encoding cytokines or lymphokines such as 
interferons, granulocytic macrophage colony stimulating factor (GM-CSF), colony 
stimulating factor-1 (CSF-1), tumor necrosis factor (TNF), and erythropoietin (EPO); 
genes encoding inhibitor substances such as alphai-antitrypsin; genes encoding 
substances that function as drugs, e.g., genes encoding the diphtheria and cholera toxins; 

10 and the like. 

Additional nucleic acids contemplated for use in accordance with the present 
invention include genes that encode proteins present in dopaminergic nevirons (useful, 
for example, for the treatment of Parkinson's disease), cholinergic neurons (usefiil, for 
example, for the treatment of Alzheimer's disease), hippocampal pyramidal neurons 

1 5 (also useful for the treatment of Alzheimer' s disease), norepinephrine neurons (usefiil, 
for example, for the treatment of epilepsy), spinal neurons (usefiil, for example, for the 
treatment of spinal injury), glutamatergic neurons (useful, for example, for the treatment 
of schizophrenia), cortical neurons (useful, for example, for the treatment of stroke and 
brain injury), motor and sensory neurons (useful, for example, for the treatment of 

20 amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for proteins encoded by exogenous 
nucleic acid(s) contemplated for use employed herein can be located in one of many 
pubUc access databases, e.g., GENBANK, EMBL, Swiss-Prot, and PIR, or in related 
journal publications. Thus, those of skill in the art have access to sequence information 

25 for virtually all known genes. Those of skill in the art can obtain the corresponding 
nucleic acid molecule directly from a public depository or from the institution that 
published the sequence. Optionally, once the nucleic acid sequence encoding a desired 
protein has been ascertained, the skilled artisan can employ routine methods, e.g., 
polymerase chain reaction (PGR) amplification, to isolate the desired nucleic acid 

30 molecule from the appropriate nucleic acid library. Thus, all known nucleic acids 
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encoding proteins of interest are available for use in the methods and products described 
herein. 

Additional components that can optionally be incorporated into the invention 
constructs include selectable markers and genes encoding proteins required for retroviral 
5 packaging, e.g., the pol gene, the gag gene, the em gene, and the like. 

Selectable markers contemplated for use in the practice of the present invention 
include antibiotic resistance genes, genes that enable cells to process metabolic 
intermediaries, and the like. Exemplary antibiotic resistance genes include genes which 
impart tetracycline resistance, genes that impart ampicillin resistance, neomycin 
10 resistance, hygromycin resistance, puromycin resistance, and the like. 

Genes that enable cells to process metabolic intermediaries include genes which 
permit cells to incorporate L-histidinol, genes encoding thymidine kinase, genes 
encoding xanthine-guanine phosphoribosyl transferase (gpt), genes encoding 
dihydrofolate reductase, genes encoding asparagine synthetase, and the like. 

15 As employed herein, the terms "subject organism" and "host" refer to the cell, 

tissue, organ or organism in need of transcriptional regulation of exogenous or 
endogenous nucleic acids. The subject organism can be mammalian or mammalian- 
derived cells or tissue. Exemplary mammals include: humans; domesticated animals, 
e.g., rat, mouse, rabbit, canine, felme, and the like; farm animals, e.g., chicken, bovine, 

20 ovine, porcine, and the like; animals of zoological interest, e.g., monkey, baboon, and the 
like, or a cell thereof. Alternatively, a subject organism can be a non-mammalian, 
preferably non-insect, such as a plant, ftingus or other non-mammalian species, or a 
cell of such a non-mammalian species. 

As employed herein, the term "ligand" (or ligand precursor) refers to a steroidal 
25 or non-steroidal substance or compound which, in its native state (or after conversion to 
its "active" form), binds to at least one of the protein tinits, or to the invention chimeric 
protein, thereby creating a Ugand/functional entity complex, which in turn can bind an 
appropriate response element and activate transcription therefrom. Ligands function to 
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modulate transcription of nucleic acid(s) maintained under the control of a response 
element. Such ligands are well known in the art. 

In accordance with one aspect of the present invention, unless and until a suitable 
ligand is administered to the host, substantially no transcription of the desired exogenous 
5 nucleic acids occurs. Since ecdysteroids, for example, are not naturally present in 

mammalian, plant and fungal systems, and the like, if it is desired that transcription of a 
particular exogenous nucleic acid be xmder precise control of the practitioner, a chimeric 
protein containing an ecdysone receptor as one of the protein units and a suitable dimer 
partner therefore is used and the exogenous nucleic acid is put under the control of an 
10 ecdysone response element, i.e. a response element to which an activated ecdysone 
receptor binds in nature. 

The terms "ecdysone" and "ecdysteroid" as interchangeably used herein, are 
employed in the generic sense (in accordance with common usage in the art), referring to 
a family of hgands with the appropriate binding and transactivation activity (see, for 
1 5 example, Cherbas et al., in Biosynthesis, metabolism and mode of action of invertebrate 
hormones (Ed. J. Hoffinann and M. Porchet), Springer- Verlag, Berlin, p 305-322. An 
ecdysone, therefore, is a compound which acts to modulate gene transcription for a gene 
maintained imder the control of an ecdysone response element. 

20-Hydroxy-ecdysone (also known as p-ecdysone) is the major naturally 
20 occurring ecdysone. Unsubstituted ecdysone (also knovm as a-ecdysone) is converted 
in peripheral tissues to P-ecdysone. Analogs of the naturally occurring ecdysones are 
also contemplated within the scope of the present invention. Examples of such analogs, 
commonly referred to as ecdysteroids, include ponasterone A, 26 iodoponasterone A, 
muristerone A, inokosterone, 26-mesylinokosterone, and the like. Since it has been 
25 previously reported that the above-described ecdysones are neither toxic, teratogenic, nor 
known to affect mammaUan physiology, they are ideal candidates for use as inducers in 
cultured cells and transgenic mammals according to the invention methods. 

Other phytoecdysteroids are also contemplated for use in the practice of the 
invention as ligands of receptors which recognizes ecdysone response elements. Such 
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phytoecdysteroids are known in the art (J.H. Adler et al, Lipids 30(3):257-62, 1995). 
The biological effect of phytoecdysteroids in higher animals are also known (V.N. 
Syrov, Eksp. Klin. Farmakol 57(5):61-6, 1994). 

Non-steroidal ligands are also contemplated for use in the practice of the present 
5 invention as ligands of ecdysone response elements. For example, when a ligand not 
normally present in the cells of the host to be treated is desired (i.e., a ligand exogenous 
to the host), a hydrazine can be employed as the ligand, preferably a diacyl hydrazine. 
Such hydrazines include compounds that are readily available and/or are relatively 
inexpensive to manufacture. One such compound, tebufenozide, is a non-steroidal 
10 ecdysone agonist which is commercially available. This compound specifically targets 
lepidopteran species, including Bombyx mori. Tebufenozide has undergone extensive 
testing in animal hosts and has proved to be of very low toxicity to mammals and other 
non-insect species. 

Additional exemplary hydrazines contemplated for use herein include 1,2-diacyl 
1 5 hydrazines (e.g., tebufenozide), N' -substituted-N,N' -disubstituted hydrazines, 

dibenzoylalkyl cyanohydrazines, N-substituted-N-alkyl-N,N-diaroyl hydrazines, N- 
substituted-N-acyl-N-aDcyls, carbonyl hydrazines, N-aroyl-N'-alkyl-N'-aroyl hydrazines, 
and the like. Since it has been previously reported that the above-described diacyl 
hydrazines are neither toxic, teratogenic, nor known to affect mammalian physiology, 
20 they are ideal candidates for use as exogenic hgands (e.g. as inducers) in cultured cells 
and transgenic mammals according to invention methods. 

Ligands, and formulations containing them, administered in a manner compatible 
with the route of administration, the dosage formulation, and in a therapeutically 
effective amomt. The reqioired dosage will vary with the particular treatment desired, 
25 the degree and duration of therapeutic effect desired, the judgment of the practitioner, as 
well as properties peculia* to each individual. Moreover, suitable dosage ranges for 
systemic application depend on the route of administration. It is anticipated that dosages 
between about 10 micrograms and about 1 milligram per kilogram of body weight per 
day will be used for therapeutic treatment. 
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An effective amount of ligand contemplated for use in the practice of the present 
invention is the amount of ligand (e.g., diacyl hydrazine(s)) required to achieve the 
desired level of transcription and/or translation of exogenous nucleic acid. A 
therapeutically effective amount is typically an amount of ligand or ligand precursor that, 
5 when administered in a physiologically acceptable composition, is sufficient to achieve a 
plasma concentration of the transcribed or expressed nucleic acid product from about 

0. 1 mg/ml to about 100 mg/ml, for example, from about 1.0 mg/ml to about 50 mg/ml, 
and preferably at least about 2 mg/ml and usually 5 to 10 mg/ml. 

Ligand can be administered in a variety of ways, as are well-known in tiie art, 

1. e., by any means that produces contact between ligand and receptor peptide. For 
example, such ligands can be administered topically, orally, intravenously, 
intraperitoneally, intravascularly, and the like. The administration can be by any 
conventional means available for use in conjunction with pharmaceuticals, e.g., by 
intravenous injection, either as individual therapeutically active ingredients or in a 
combination with other therapeutically active ingredients. Ligands contemplated for use 
in the practice of the present invention can be administered alone, but are generally 
administered with a pharmaceutical carrier selected on the basis of the chosen route of 
administration and standard pharmaceutical practice. 

fri accordance with a particular embodiment of the present invention, 
20 pharmaceutically acceptable formulations, and kits thereof, comprising at least one 
ligand for an invention fimctional dimer, for example an ecdysteroid, and a 
pharmaceutically acceptable carrier are contemplated. In accordance with another aspect 
of the present invention, pharmaceutically acceptable formulations consisting essentially 
of at least one ligand and a pharmaceutically acceptable carrier, are contemplated. 
25 Pharmaceutical formulations of the present invention can be used in the form of a solid, 
a solution, an emulsion, a dispersion, a micelle, a liposome, and the like, wherein the 
resulting formulation contains one or more of the ligands of the present invention, as an 
active ingredient, in admixture with an organic or inorganic carrier or excipient suitable 
for enteral or parenteral apphcations. 
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The ligand(s) may be compounded, for example, with the usual non-toxic, 
pharmaceutically acceptable carriers suitable for administration by oral, topical, nasal, 
transdermal, intravenous, subcutaneous, intramtiscular, intracutaneous, intraperitoneal, 
intravascular, and the like means. Administration in the form of creams, lotions, tablets, 
5 dispersible powders, granules, syrups, elixirs, sterile aqueous or non-aqueous solutions, 
suspensions or emulsions, and the like, is contemplated. Exemplary pharmaceutically 
acceptable carriers include carriers for tablets, pellets, capsules, suppositories, solutions, 
emulsions, suspensions, and any olher form suitable for use. Such carriers which can be 
used include glucose, lactose, gum acacia, gelatin, mannitol, starch paste, magnesium 

10 trisilicate, talc, com starch, keratin, colloidal siUca, potato starch, urea, medium chain 
length triglycerides, dextrans, and other carriers suitable for use in manufacturing 
preparations, in solid, semisolid, or liquid form. In addition auxiliary, stabilizing, 
thickening and coloring agents and/or perfumes may be used. The active compound 
(e.g., ecdysteroid as described herein) is included in the pharmaceutically acceptable 

15 formulation in an amount sufficient to produce the desired effect upon the process or 
condition of diseases. 

Pharmaceutically acceptable formulations containing hgand(s) as active 
ingredient may be in a form suitable for oral use, for example, as aqueous or oily 
suspensions, syrups or elixirs, tablets, troches, lozenges, dispersible powders or granules, 

20 emulsions, or hard or soft capsules. For the preparation of oral liquids, suitable carriers 
include emulsions, solutions, suspensions, syrups, and the like, optionally containing 
additives such as wetting agents, emulsifying and suspending agents, dispersing agents, 
sweetening, flavoring, coloring, preserving and perfuming agents, and the like. 
Formulations intended for oral use may be prepared according to any method known to 

25 the art for the manufacture of pharmaceutically acceptable formulations. 

Tablets containing ligand(s) as active ingredient in admixture with non-toxic 
pharmaceutically acceptable excipients may also be manufactured by known methods. 
The excipients used may be, for example, (1) inert diluents such as calcium carbonate, 
lactose, calcium phosphate or sodium phosphate; (2) granulating and disintegrating 
30 agents such as com starch, potato starch or alginic acid; (3) binding agents such as gum 
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tragacanth, com starch, gelatin or acacia, and (4) lubricating agente such as magnesium 
stearate, stearic acid or talc. The tablets may be uncoated or they may be coated by 
known techniques to delay disintegration and absorption in the gastrointestinal tract and 
thereby provide a sustained action over a longer period. For example, a time delay 
5 material such as glyceryl monostearate or glyceryl distearate may be employed. They 
may also be coated by the techniques described in the U.S. Pat. Nos. 4,256,108; 
4,160,452; and 4,265,874, to form osmotic therapeutic tablets for controlled release. 

In some cases, formulations for oral use may be in the form of hard gelatin 
capsules wherein the ligand is mixed with an inert solid diluent, for example, calcium 
10 carbonate, calcium phosphate or kaolin. They may also be in the form of soft gelatin 
capsules wherein the ligand is mixed with water or an oil medium, for example, peanut 
oil, liquid paraffin, or olive oil. 

The pharmaceutically acceptable formulations may be in the form of a sterile 
injectable suspension. Suitable carriers include non-toxic parenterally-acceptable sterile 

15 aqueous or non-aqueous solutions, suspensions, or emulsions. This suspension may be 
formulated according to known methods using suitable dispersing or wetting agents and 
suspending agents. They can also be manufactured in the form of sterile water, or some 
other sterile injectable medium immediately before use. Sterile, fixed oils are 
conventionally employed as a solvent or suspending mediimi. For this purpose any 

20 bland fixed oil may be employed including synthetic mono- or diglycerides, fatty acids 
(including oleic acid), naturally occurring vegetable oils like sesame oil, coconut oil, 
peanut oil, cottonseed oil, etc., or synthetic fatty vehicles like ethyl oleate or the like. 
They may be sterilized, for example, by filtration through a bacteria-retaining filter, by 
incorporating sterilizing agents into the formulations, by irradiating the formulations, or 

25 by heating the formulations. Sterile injectable suspensions may also contain adjuvants 
such as preserving, wetting, emulsifying, and dispersing agents. Buffers, preservatives, 
antioxidants, and the like can be incorporated as reqviired. 

Compounds contemplated for use in the practice of the present invention may 
also be administered in the form of suppositories for rectal administration of the drug. 
30 These formulations may be prepared by mixing the drug with a suitable non-irritating 
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excipient, such as cocoa butter, synthetic glyceride esters of polyethylene glycols, which 
are solid at ordinary temperatures, but liquefy and/or dissolve in the rectal cavity to 
release the drug. 

PharmaceuticaUy acceptable formulations containing suitable ligand(s) are 
5 preferably administered intravenously, as by injection of a unit dose, for example. The 
term "unit dose," when used in reference to a pharmaceutically acceptable formulation 
of the present invention, refers to a quantity of the pharmaceutical formulation suitable 
as unitary dosage for the subject, each imit containing a predetermined quantity of active 
material calculated to produce the desired therapeutic effect in association with the 
10 required diluent, i.e., carrier, or vehicle. It may be particularly advantageous to 

administer such formulations in depot or long-lasting form as discussed hereinafter. 

Therapeutic compositions or pharmaceutically acceptable formulations 
containing suitable ligand are preferably administered intravenously, as by injection of a 
unit dose, for example. The term "unit dose," when used in reference to a therapeutic 
15 composition of the present invention, refers to a quantity of ligand suitable as unitary 
dosage for the subject, each umi containing a predetermined quantity of active material 
calculated to produce the desired therapeutic effect in association with the required 
diluent, i.e., carrier, or vehicle. It may be particularly advantageous to administer such 
compounds in depot or long-lasting form. 

20 Suitable regimes for initial administration and booster shots are variable, but are 

typified by an initial administration followed by repeated doses at one or more intervals, 
by a subsequent injection, or other administration. Altematively, continuous intravenous 
infusion sufficient to maintain concentrations in the blood in the ranges specified for 
in vivo therapies are contemplated. 

25 In accordance witii another embodiment of the present invention, there are 

provided methods for producing transgenic animals capable of prolonged and regulated 
expression of exogenous nucleic acid(s), said method comprising introducing into early- 
stage embryos or stem cells of the animal: 
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(i) a nucleic acid construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a regulatory element; and 

(ii) nucleic acid encoding an invention chimeric protein wherein the chimeric 
protein activates the regulatory element in the presence of a ligand for the 

5 functional dimer or represses the regulatory element independently of the 

presence of said ligand. 

As used herein, the phrase "transgenic animal" refers to an animal that contains 
one or more expression constructs containing one or more exogenous nucleic acid(s) 
under the transcription control of an operator and/or hormone response element as 
10 described herein. 

Methods of making transgenic animals using a particular nucleic acid construct 
are well-known in the art. When preparing invention transgenic animals, it is presently 
preferred that two transgenic lines are generated. The first line will express, for 
example, a chimeric protein as described above (e.g., VBEcR). Tissue specificity is 

15 conferred by the selection of a tissue-specific promoter (e.g., T-cell specific) that will 
direct expression of the chimeric protein to appropriate tissue. A second line contains a 
nucleic acid construct comprising a promoter and exogenous nucleic acid under the 
control of a response element, for example, an endogenous response element. Cross- 
breeding of Ihese two lines will provide a transgenic animal that expresses an invention 

20 chimeric protein and the exogenous nucleic acid. 

In a presently preferred embodiment, an invention transgenic animal contains 
one or more expression constructs containing nucleic acid encoding an invention 
chimeric protein and exogenous nucleic acid under the transcription control of a 
response element. Thus, with tissue specific expression of the chimeric protein as 
25 described above and timely hgand treatment, gene expression can be mduced or 
repressed with spatial, dosage, and/or temporal specificity. 

In accordance with yet another embodiment of the present invention, there are 
provided methods for modulating the transcription of an exogenous nucleic acid in a host 
containing: 
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(i) a nucleic acid construct comprising a promoter and said exogenotis 
nucleic acid(s) under ^e control of a response element; and 

(ii) nucleic acid under the control of an inducible promoter, said nucleic acid 
encoding an invention chimeric protein whereia the functional entity formed by 
the invention chimeric protein activates or represses the response element in the 
presence of a ligand for the entity; and 

said method comprising introducing a ligand not normally present in the 
cells of the host and subjecting the host to conditions suitable to induce or repress 
expression of the invention chimeric protein. 

In accordance with yet another embodiment of the present invention, there are 
provided methods for the expression of recombinant products detrimental to a subject 
organism, said method comprising: 

transforming suitable cells in the organism with: 

(i) a nucleic acid construct comprising a promoter and exogeno\is nucleic 
acid(s) which express the recombinant product under the control of a regulatory 
element that is not normally present in the cells of said organism, and 

(ii) nucleic acid encoding an invention chimeric protein 

wherein the functional entity formed by the invention chimeric protein 
activates the regulatory element in the presence of a ligand for the ftinctional 
entity; 

growing said cells to the desired level in the substantial absence of the 
ligand; and 

inducing expression of said recombinant product by administering to the 
organism a ligand, which, in combination with said entity, binds to said 
regulatory element and activates transcription therefrom. 

Recombmant products detrimental to a host organism contemplated for 
expression in accordance with the present invention include any gene product that 
functions to confer a toxic effect on the organism. For example, inducible expression of 
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a toxin such as the diphtheria toxin would allow for specific ablation of tissue (Ross et 
al. Genes and Development 7:1318-1 324 ( 1 993)), for example to create a new phenotype 
in the transgenic animal. Moreover, the numerous gene products that are known to 
induce apoptosis in cells expressing such products are contemplated for use herein (see, 
5 e.g, Apoptosis, The Molecular Basis of Cell Death, Current Communications In Cell & 
Molecular Biology, Cold Spring Harbor Laboratory Press, 1991). 

In accordance with still another embodiment of the present invention, there are 
provided methods for modulating the transcription of nucleic acid(s) in an in vitro 
cellular system, wherein the method comprises administering to the cellular system an 
1 0 amount of Ugand effective to modulate the transcription of the nucleic acid(s); wherein 
the hgand is not normally present in the cellular system and wherein the system 
comprises: 

(i) a nucleic acid construct comprising a promoter and the nucleic acid(s) 
under the control of a response element; and 

15 (ii) nucleic acid encoding an invention chimeric protein, 

wherein the ftinctional entity formed by the invention chimeric protein 
activates or represses the regulatory element in the presence of a ligand for the 
ligand binding domain. 

In accordance with yet another embodiment of the present invention, there are 
20 provided methods for the treatment of a host in need of gene therapy, said method 
comprising: 

introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter and the exogenous 
nucleic acid(s) under the control of a response element; 



25 (ii) nucleic acid under the control of an inducible promoter, said nucleic acid 

encoding an invention chimeric protein. 
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wherein the functional dimer fomed by the invention chimeric protein 
activates or represses the regulatory element in the presence of a ligand for the 
functional dimer, and 

administering, to said host, an effective amount of ligand for the 
5 invention functional dimer. 

Optionally, the cells can be obtained from the host, modified as above, and then 
reintroduced into the host organism. For example, the exogenous nucleic acid can be 
introduced directly into cells obtained from a donor (host or separate donor) and the 
modified cells are then implanted within the host organism. In a presently preferred 
1 0 embodiment, the transplanted cells are autologous with respect to the host. 

"Autologous" means that the donor and recipient of the cells are one and the same. 

Cells can be modified by "m vivo delivery" of biological materials by such routes 
of administration as oral, intravenous, subcutaneous, intraperitoneal, intrathecal, 
intramuscular, intracranial, inhalational, topical, transdermal, suppository (rectal), 
1 5 pessary (vaginal), and the like. The exogenous nucleic acid may be stably incorporated 
into cells or may be transiently expressed using methods known in the art. 

Modified cells are cultivated under growth conditions (as opposed to protein 
expression conditions) until a desired density is achieved. Stably transfected mammalian 
cells may be prepared by transfecting cells with an expression vector having a selectable 

20 marker gene (such as, for example, the gene for thymidine kinase, dlhydrofolate 

reductase, neomycin resistance, and the like), and grovraig the transfected cells under 
conditions selective for cells expressing the marker gene. To prepare transient 
transfectants, mammalian cells are transfected with a reporter gene (such as the E. coli 6- 
galactosidase gene) to monitor transfection efficiency. Selectable marker genes are 

25 typically not included in the transient transfections because the transfectants are typically 
not grown under selective conditions, and are usually analyzed within a few days after 
transfection. 

The concept of gene replacement therapy for humans involves the introduction 
of functionally active "wild type" or "therapeutic" nucleic acids into the somatic cells of 
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an affected host to correct a gene defect or deficiency. However, in order for gene 
replacement therapy to be effective, it must be possible to control the time and location 
at which gene expression occurs. 

Genes that encode useful "gene therapy" proteins that are not normally 
5 transported outside the cell can be used in the invention if such genes are "functionally 
appended" to, or operatively associated with, a signal sequence that can "transport" the 
encoded product across the cell membrane. A variety of such signal sequences are 
known and can be used by those skilled in the art without undue experimentation. 

Gene transfer vectors (also referred to as "expression vectors") contemplated for 
use herein are recombinant nucleic acid molecules that are used to transport nucleic acid 
into host cells for expression and/or replication thereof Expression vectors may be 
either circular or linear, and are capable of incorporating a variety of nucleic acid 
constructs therein. Expression vectors typically come in the form of a plasmid that, upon 
introduction into an appropriate host cell, results in expression of the inserted nucleic 
acid. 

Suitable expression vectors for use herein are well known to those of skill in the 
art and include recombinant DNA or RNA construct(s), such as plasmids, phage, 
recombinant virus or other vectors that, upon introduction into an appropriate host cell, 
result(s) in expression of the inserted DNA. Appropriate expression vectors are well 
20 known to those of skill in the art and include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal or those which integrate into the 
host cell genome. Expression vectors typically further contain other fimctionally 
important nucleic acid sequences encoding antibiotic resistance proteins, and the like. 

The amount of exogenous nucleic acid introduced into a host organism, cell or 
25 cellular system can be varied by those of skill in the art. For example, when a viral 

vector is employed to achieve gene transfer, the amount of nucleic acid introduced can 
be varied by varying the amount of plaque forming units (PFU) of the viral vector. 

As used herein, the phrase "transcription regulatory region" refers to that portion 
of a nucleic acid or gene construct that controls the initiation of mRNA transcription. 
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Regulatory regions contemplated for use herein, in the absence of the non-mammalian 
transactivator, typically comprise at least a minimal promoter in combination with a 
regulatory element responsive to the ligand/receptor peptide complex. A minimal 
promoter, when combined witii a regulatory element, functions to initiate mRNA 
5 transcription in response to a ligand/functional dimer complex. However, transcription 
will not occur unless the required inducer (ligand therefor) is present. However, as 
described herein certain of the invention chimeric protein heterodimers activate or 
repress mRNA transcription even in the absence of ligand for the DNA binding domain. 

As used herein, the phrase "operatively associated with" refers to the functional 
10 relationship of DNA with regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational stop sites, and other signal 
sequences. For example, operative linkage of DNA to a promoter refers to the physical 
and functional relationship between the DNA and promoter such that the transcription of 
such DNA is initiated from the promoter by an RNA polymerase that specifically 
15 recognizes, binds to and transcribes the DNA. 

Preferably, the transcription regulatory region further comprises a binding site 
for ubiquitous transcription factor(s). Such binding sites are preferably positioned 
between the promoter and the regulatory element. Suitable ubiquitous transcription 
factors for use herein are well-known in the art and include, for example, Spl . 

20 Exemplary eukaryotic expression vectors include eukaryotic constructs, such as 

the pSV-2 gpt system (Mulligan et al, (1979) Nature, 277:108-1 14); pBlueSkript 
(Stratagene, La JoUa, CA), the expression cloning vector described by Genetics Institute 
(Science, (1985) 228:810-815), and the like. Each of these plasmid vectors is capable of 
promoting expression of the chimeric protein of interest. 

25 Suitable means for introducing (transducing) expression vectors containing 

invention nucleic acid constructs into host cells to produce transduced recombinant cells 
(i.e., cells containing recombinant heterologous nucleic acid) are well-known in the art 
(see, for review, Friedmann, Science, 244:1275-1281, 1989; Mulligan, Science, 260:926- 
932. 1993, each of which are mcorporated herein by reference in their entirety). 
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Exemplary methods of transduction include, e.g., infection employing viral vectors (see, 
e.g., U.S. Patent 4,405,712 and 4,650,764), calcium phosphate transfection (U.S. Patents 
4,399,216 and 4,634,665), dextran sulfate transfection, electroporation, lipofection (see, 
e.g., U.S. Patents 4,394,448 and 4,619,794), cytofection, particle bead bombardment, 
5 and the like. The transduced nucleic acid can optionally include sequences which allow 
for its extrachromosomal (i.e., episomal) maintenance, or the transduced nucleic acid can 
be donor nucleic acid that integrates into the genome of the host. 

In a specific embodiment, a gene transfer vector contemplated for use herein is a 
viral vector, such as Adenovirus, adeno-associated virus, a herpes-simplex virus based 

10 vector, a synthetic vector for gene therapy, and the like (see, e.g., Suhr et al, Arch, of 
Neurol. 50: 1252-1268, 1993). Preferably, a gene transfer vector employed herein is a 
retroviral vector. Retroviral vectors contemplated for use herein are gene transfer 
plasmids that have an expression construct containing an exogenous nucleic acid 
residing between two retroviral LTRs. Retroviral vectors typically contain appropriate 

1 5 packaging signals that enable the retroviral vector, or RNA transcribed using the 
retroviral vector as a template, to be packaged into a viral virion in an appropriate 
packaging cell line (see, e.g., U.S. Patent 4,650,764). 

Suitable retroviral vectors for use herein are described, for example, in U.S. 
Patents 5,399,346 and 5,252,479; and in WIPO pubUcations WO 92/07573, WO 

20 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, each of which is hereby 

incorporated herein by reference, in its entirety. These documents provide a description 
of methods for efficiently introducing nucleic acids into human cells using such 
retroviral vectors. Other retroviral vectors include, for example, mouse mammary tumor 
virus vectors (e.g., Shackleford et a/., (1988) PNAS, USA, 85:9655-9659), human 

25 immimodeficiency virus (e.g., Naldini et al. (1996) Science 272: 1 65-320), and the like. 

Various procedures are also well-known in the art for providing helper cells 
which produce retroviral vector particles that are essentially free of replicating virus. 
See, for example, U.S. Patent 4,650,764; Miller, Human Gene Therapy, 1:5-14, 1990; 
Markowitz, et al. Journal of Virology, 61(4): 11 20-1 124, 1988; Watanabe, etal, 
30 Molecular and Cellular Biology, 3a2):224 1-2249, 1983; Danos, et al, PNAS, 85:6460- 
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6464, 1988; and Bosselman, et al. Molecular and Cellular Biology, 7(5): 1797- 1806, 
1987, which disclose procedures for producing viral vectors and helper cells that 
minimize the chances for producing a viral vector that includes a replicating virus. 

Recombinant retroviruses suitable for carrying out the invention methods are 
5 produced employing v^ell-known methods for producing retroviral virions. See, for 
example, U.S. Patent 4,650,764; Miller, supra 1990; Markowatz, et al, supra 1988; 
Watanabe, et al, supra 1983; Danos, et al, PNAS, 85:6460-6464, 1988; and Bosselman, 
et al , Molecular and Cellular Biology, 7(5): 1 797- 1 806, 1 987. 

For example, in one embodiment, a modular assembly retroviral vector (MARV) 
1 0 can be utilized to express the invention chimeric protein and an antibiotic resistance 

gene. A "covector" (referred to herein as MARSHA) can be utilized to provide a nucleic 
acid construct comprising the promoter, the regulatory element and exogenous nucleic 
acid, and a second antibiotic resistance gene. The MARSHA vector carrying exogenous 
nucleic acid also has LTRs modified to promote high-level expression only in the 
15 presence of the invention chimeric protein encoded by MARV and exogenous Ugand 
therefor. Co-infected primary mammalian cells can then be selected using both 
antibiotics, resulting in a cell population that is dependent on ligand for high-level 
expression of the exogenous nucleic acid. 

By introducing all of the necessary regulatory machinery, plus exogenous 
20 nucleic acid, selectable markers, and nucleic acid encoding invention chimeric protein, 
e.g., into a MARV retrovirus, highly efficient insertion of exogenous nucleic acids into 
targeted cells can be achieved. 

Thus, the above-described viral constructs address several important problems 
confronted in the use of retroviruses in application of therapeutic gene transfer strategies 
25 to a variety of human diseases. For example, the retroviral vectors of the invention are 
capable of prolonged gene expression under conditions where conventionally integrated 
retroviruses are no longer transcriptionally active. 

To illustrate the invention chimeric protein FDs, EcR was used as the 
steroid/thyroid hormone nuclear receptor and multiple examples using either RXR or 



48 



Usp as the dimer partner were constructed, with either the EcR or the dimer partner 
positioned at the amino terminus. The DNA binding, transactivation, and 
dimerization properties of these several FD variants were compared with the 
properties of native receptor complexes. The size of the ENU and ENR FDs prepared 
5 was in the range from about 135 to about 145 kD; whereas E alone had a size of 94 
kD, as shown by Western blot analysis. 

EcR-Usp and EcR-RXR FDs efficiently bind EcREs 

The FDs were first examined for their ability to bind to target EcREs in 
response to ligand. FD proteins were extracted from transiently transfected hximan 

10 293 cells that were either untreated or treated with 1 f^M murA for 40 hours. To 

eliminate the possibility that certain of the FD constructs were translated with greater 
efficiency than others, which would lead to a false appearance of higher fimctional 
binding in comparative luciferase expression tests, P-galactosidase expression of 
internal control plasmids cotransfected with FD constructs was performed. These 

15 tests indicated no significant differences in the transfection efficiency of individual 
FD constructs, indicating that the DNA binding differences observed for different FDs 
reflect intermolecular properties of the FDs themselves, not expression level. 
Accordingly, all reactions were normalized to an internal |3-galactosidase control and 
total protein using the anti-EcR monoclonal antibody DDA2.7 (Koelle et al, supra, 

20 1 991). 

FD constructs with either 0 or 5 linker segments were assayed for their ability 
to bind labeled EcREs, as shown in Figure 2B. A prominent band co-migrating with 
band shifts for the separate dimer complexes (E+U and E+R) was observed in many 
lanes and indicated that some of the FDs formed fimctional DNA-binding internal 
25 dimers, referred to herein as "endodimers." For example, ROE, which is analogous to 
UOE, except for the substitution of U for R as the dimer partner, formed a clear 
endodimer band that was increased 4-fold by ligand, and demonstrated imequivocally 
a response to hormone. R5E, with a longer linker, had slightly decreased basal, and 
slightly increased ligand-stimulated, EcRE binding (5-fold) compared to that of ROE. 
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Constructs in which E is positioned at the N-terminus fonn endodimers and 
bind the EcRE probe an average of 10 times better than UNE constructs (Figure 2A). 
In addition, ENU constructs bind probe 80-150% more readily than even those other 
FDs with high-level EcRE binding, such as E5R, but display nearly complete 
5 insensitivity to ligand for formation of DNA-binding complexes. This effect was 
found to be substantially independent of linker length. 

ENR FDs also demonstrate a greater affmity for the EcRE probe than do RNE 
constructs. However, unlike ENU constructs, ENR FDs have a high degree of 
dependence upon the presence of ligand for formation of endodimers. 

The observation that ENU and ENR FDs bound probe better than either the 
RNE construct or the UNE construct prompted examination of whether the large 220 
amino acid F domain of E (not found on R or U) accounted for this effect. Further 
EcRE probe binding studies performed utilizing several FDs having in-frame 
incremental deletions of EOR to Nhel, PvuII, Narl, and Bglll sites within the 
ecdysone receptor F domain (Figure 2B) showed that the deletions had a minimal 
effect on either response to ligand, or on binding to the EcRE probe. Only EOR- 
ABglll, in which the extreme C-terminal end of the hormone binding domain is 
removed, displayed significant loss of the shifted band. These results suggest that 
flexibility within the long EcR F domain is not the primary determinant of improved 
DNA binding by FDs containing a EcR at the N terminal. This was presumed to be 
the result of a perturbed ligand binding pocket as opposed to decreased flexibility of 
the dimer partners joined by a linker. 

EcR-RXR and EcR-Usp FD Transactivation in Response to Ligand 

The results shown in Figure 2 clearly indicate that the EcR-RXR and EcR-Usp 
25 FD chimeric proteins could both respond to hormone and interact with the response 
element for EcR; however, these tests provided no indication regarding the function 
of these proteins to transactivate responsive promoters and induce gene expression. 
To determine the ability of the FDs to transactivate responsive promoters and induce 
gene expression, the FDs and an EcRE-luciferase reporter plasmid were co- 
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transfected into 293 cells. The results of these studies revealed that the activity of the 
FDs was variably diminished compared to that of monomeric receptors. 

It has also been discovered that the invention ENU FDs are constitutive 
repressors of transcription of a gene under the control of the corresponding 
5 steroid/thyroid hormone nuclear receptor's response element. For example, UNE 
constructs did not efficiently bind target EcREs (Figure 2), and predictably did not 
have a dramatic influence on expression of the E4-luc reporter plasmid. ENUs, on the 
other hand, although appearing to readily bind EcREs, were also unable to stimulate 
luciferase expression. To further confirm whether ENU proteins could bind target 

1 0 response elements, but had lost the capacity to transactivate, the ability of EOU to 
competitively block ligand-stimulated luciferase expression by monomeric receptors 
was tested. EOU elicited a dose-dependent inhibition of VE with endogenous dimer 
partner, whereas UOE had virtually no inhibitory influence (Figure 5 A). At the lowest 
ratio tested, a 1:20 ratio of EOU to VE decreased stimulated expression of VE by 

15 20%, while equimolar EOU blocked 80% of the response to ligand. At any 

concentration tested, UOE exhibited no suppressive effect on VE-mediated activation, 
and m fact, appeared to increase the stimulated level of expression by about 5% to 
about 1 5%. EOU had a similar influence on E without VP 16 fiision (Figure 5B), and a 
lesser but measurable inhibitory effect on VE combined with separate exogenous Usp 

20 (Figure 5C). The 50% inhibition of the VE+U basal transactivation level may suggest 
that EOU binds the target EcRE about as well as complexes of the separate receptors. 
The plateau of the EOU suppressive effect at a 1 :1 ratio v^th both E and E+U and the 
slightly increased stimulation at a 5:1 ratio of EOU to separate receptors indicate that, 
at higher concentrations, EOU can weakly transactivate in response to added ligand. 

25 In the experiments described herein utilizing FDs containing various 

combinations of DEcR and BEcR with either RXR or Usp, the length of the linker 
was not observed to have a significant effect on any of the functions of the FDs. 
Surprisingly, even shortening of the large F-domain of Drosophilia melanogaster EcR 
had little impact on ligand-responsive DNA binding of EcR FDs. Without restriction 

30 of the scope of the invention by any theoretical speculations, two possible 
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explanations of this observation are offered. One possibility is that there is enough 
flexibility within the structure of the individual receptors that any deformation 
necessary to allow appropriate dimerization can be tolerated while preserving, in 
some cases, nearly complete activity. The second possibility is that the C-terminus of 
5 the 5 ' receptor naturally lies close in spatial proximity to the N-terminus of the 3 ' 
receptor such that the distance is easily spanned. In the absence of detailed structural 
data of intact nuclear hormone receptor dimers, neither explanation can be ruled out. 

In the presence of ligand (i.e., MurA), RXR-containing FDs functioned 
similarly to monomeric ECR+RXR, but at approximately one-half of the maximum 

10 level of transactivation (for E5R). One likely explanation for this is that the FD 

constructs are expressed or translated less efficiently than the monomeric receptors as 
suggested by comparing the intensity of FD lanes to the E lane in a Western blot 
analysis. Although the level of absolute transactivation was halved, the relative 
induction of transcription by the ENR constructs, in particular, exceeded the relative 

1 5 induction of monomeric EcR and RXR. 

To further address the partial or complete loss of transcriptional activation for 
FDs relative to the separate proteins, the powerful VP 16 transactivating domain was 
coupled to the amino-termini of FDs or separate EcR to determine if this addition 
could restore lost transactivational capacity. While a linker containing 5 linker 
20 segments (65 amino acids) was sufficient to allow good DNA binding, it may not 
have allowed sufficient freedom of movement for other domains, including those 
responsible for transactivation, to fold or orient as they do in the native proteins. 
Length of the linkers in VE5R and VE5U was therefore increased in increments of 5 
linker segments each to 10 and ultimately 20 linker segments (240 + 5 amino acids). 

25 Addition of the VPl 6 activation domain to the amino terminus of the ENR 

FDs restored the full transactivation potential of the FDs relative to separate VP16- 
flised EcR, suggesting that addition of a strong transactivating domain overrides 
translational deficits caused by incorporating the receptor/dimer partner into a 
chimeric protein. However, VP16-ENU (VENU) chimeric proteins never exceeded 

30 30% of the absolute level of transactivation of separate VE+U. One possible 
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explanation is that some conformation constraint in the Usp half of the VENU 
constructs prevented the interaction of VENU with endogenous cofactors necessary 
for full high level expression. A second possibility is that increased spontaneous 
heterodimerization within ENU FDs results in a conformational alteration that 
5 decreases or blocks ligand binding, assuming that ligand plays a direct role in 
transactivation and not just dimerization. 

High level VENR transactivation suggests that the very low level expression 
observed from unliganded ENR constructs in Figure 3 was due to transcriptional 
repression by bound ENR proteins. The addition of VP 16 increased the basal VENR 

10 FD expression over 5-fold in comparison to separate VECR+RXR. This presimiably 
reflected an increased level of spontaneous dimer formation of ENR FDs resulting 
from the forced proximity of the separate components by the linker. This 
phenomenon may not have been readily evident in the gel shift experiments as the 
result of the short transient period of interaction of the proteins with the DNA probes 

1 5 (<3 0 minutes) compared to the duration with the responsive promoters in the transient 
transfection experiments (>30 hours). 

Heterodimer formation and DNA binding of FDs 

Five different classes of interaction of protein units in the invention chimeric 
protein(s) (Figure 7) were predicted: 1) "disorganized", indicative of non-interaction 

20 by the individual receptor components; 2) "endodimer", indicating formation of 
functional dimers approximating the native heterodimer complex; 3) "trimer", 
indicating interaction with a separate monomeric partner, 4) "tetramer", indicative of 
two mutually cross-interacting chimeric proteins ; or 5) "oligomers", representing 
chimeric proteins chain-interacting with each other. The data presented herein 

25 provides clear evidence that disorganized, endodimer, and trimer species were formed 
when the invention chimeric proteins contained two protein units (FDs), with 
endodimer formation predominating. Only the UNE constructs appeared to be largely 
disorganized, as evidenced by their apparent inabiUty even to bind DNA with high 
affinity. All remaining FD classes showed abundant evidence of endodimer 

30 formation, although RNE constructs were noticeably weaker than constructs with EcR 
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in the amino terminal position. The transient-transfection competition experiment 
(Figure 4) indirectly indicates that a high affinity monomeric dimer partner, such as 
VUsp, can displace a weaker intramolecular RXR dimer partner to form a partially 
functional trimer species; however, such a monomeric, high affinity dimer partner 
5 was much less capable of displacing an intramolecular Usp under the same 

circumstances. The lower affinity VRXR monomeric dimer partner was vinable to 
displace to any significant degree either Usp or RXR as an intramolecular dimer 
partner in an invention FD fusion protein. 

Evidence of formation of higher order constructs, such as tetramers or 
10 oligomers, in gel shift assays is scant, but may be suggested in lanes of the gel shift 
assay with UNE and RNE constructs. Although band shifts were weak, UNE (and 
unliganded RNE) constructs had slightly intensified bands of higher molecular weight 
than the corresponding size of the endodimer bands. The faint but detectable high 
molecular weight shift bands observed for FDs in some lanes of the assays suggest 
15 tetramer and oligomer formation, presiamably through cross interaction of the 
ecdysone receptor component of one FD with the dimer partner unit of another. 
These results, coupled v^th the results of competition experiments using 
superphysiological levels of competing dimer partner, suggest, in any event, that 
multimerization is relatively rare and is likely to occur, at even low levels, only with 
20 those FDs that have decreased capacity for endodimer formation. 

These data fiirther support the supposition that proximity to dimer partner (i.e., 
as in invention chimeric protein(s)) not only limits dimer partner preference, but also 
increases the ease of dimer formation and DNA binding for some of the fiision 
constructs relative to monomeric receptors. For example, ENU constructs displayed 

25 high-level complex formation with the EcRE probe (a 1 . 1- and 0.9-fold increase for 
EU and E5U, respectively) even in the absence of ligand, while separate EcR and Usp 
required ligand for maximal complex formation. ENR constructs, on the other hand, 
still retained much of their original ligand dependence, indicating that dimer partner 
proximity is not the sole, or perhaps even most important, determinant of dimer 

30 formation. 
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The degree to which FDs interact with external receptors to form a trimer 
complex was indirectly examined in the studies showing their interaction with high 
levels of competing VP16-fiision dimer partners and the resulting effect on 
transactivation of the E4-luc reporter. In these studies, monomeric VRXR was unable 
5 to enter the FD complex of any construct, suggesting that the EcR component of FDs 
much prefers a linked dimer partner of either high or low affinity to a separate low 
affinity dimer partner. VUsp, on the other hand, had a comparatively smaller effect 
on transactivation induced by the E5U construct, than on E5R, indicating that the EcR 
protein in E5U preferentially dimerizes with the linked Usp, while the RXR dimer 
1 0 partner of E5R may enter a complex with monomeric VUsp. 

In summary, the results of the studies described herein indicate that selected 
chimeric proteins of steroid/thyroid hormone nuclear receptors with appropriate dimer 
partners can retain most of the primary characteristics of the native complex: binding 
of ligand, recognition and binding of cognate response elements, and, in some cases, 

15 ligand-stimulated transactivation of responsive promoters. Subsets of the constructs 
prepared displayed varying degrees of these characteristics. The RAJNE proteins 
characteristically exhibited low DNA binding and transactivation capacity while the 
ENRAJ proteins uniformly demonstrated wild-type or superior EcRE binding and 
variable capacity to transactivate, resulting in properties ranging from constitutive 

20 repression to essentially wildtype, ligand-responsive transactivation. 

The invention herein provides the advantage that, for many studies in cultured 
cells or transgenic animals, invention FDs will allow the examination of specific 
heterodimer pairs Avith much decreased potential for contamination with exterior 
dimer partners, such as those endogenously produced in the test cell or animal. This 
25 may be of particular assistance in examining the function of specific RXR subtype 
combinations, or even for further studying the potential for Usp-Usp interactions. 

The invention herein provides the further advantage that specific heterodimer 
pairs can be examined by their introduction into the system as a single chimeric 
protein, e.g., as a fusion protein, rather than by separate introduction of two 
30 constructs. 
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In addition, studies described in the Examples herein indicate that many of the 
combinations may have unique properties of ligand independence or repression that 
may have significance to their application for therapeutic purposes. For example, 
certain of the invention chimeric proteins that transactivate gene expression may be 
5 useful as a "gene switch," for modulating expression of an exogenous gene in a 

mammalian system or in plants, fungi and other non-mammaUan species. When the 
FDs transactivate the response element-containing promoter (e.g., in the presence of 
ligand), the exogenous gene is switched on; when the FDs repress the promoter, the 
exogenous gene is switched off. 

10 In nature, dimers of nuclear hormone receptors are unstable and, hence, are 

not useful in x-ray crystallography studies to determine structure. Because of the 
demonstrated stability of the invention FD heterodimers, they may be advantageously 
used in the preparation of crystals for x-ray diffraction studies for use in rational 
design of ligands to develop new steroids, insecticides, steroid antagonists, and the 

15 like, as described hereinbelow. Crystal structure may also permit deduction of the 
structure of ligands for orphan receptors. It is also contemplated that such crystals of 
the invention FDs can be used for preparation of antibodies that react with the 
heterodimers using methods known in the art. 

In accordance with another embodiment of the present invention, there are 
20 provided isolated protein crystals suitable for x-ray diffraction analysis of a purified 
invention chimeric protein. In alternative embodiments, the crystal may be obtained 
of a ligand bovmd to a purified chimeric protein so as to form a chimeric protein- 
ligand complex, or a crystal may be obtained of a putative response element bound to 
purified fusion protein or fusion protein ligand complex as described herein. The 
25 invention additionally contemplates a set of x-ray diffraction crystal coordinates 
obtained by x-ray diffraction of any such invention isolated protein crystals. 

A variety of methods are known in the art for purifying proteins and obtaining 
crystals of the purified proteins, for example growing crystals in microgravity and/or 
by vapor diffusion (D.R. Davies and D. M. Segal, Meth. Enzymol. 22:266, 1971). 
30 Crystals of purified proteins can also be obtained commercially. To aid in the 
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purification of the invention fusion proteins, it is recommended to add a His tag to the 
amino terminus of the fusion protein, as is known in the art and described in Example 
6 herein. Addition of such a His tag does not interfere with dimerization of the 
invention fusion proteins. 

5 In accordance with still another embodiment of the present invention, there are 

provided methods for identifying potential ligand(s) for member(s) of the 
steroid/thyroid hormone nuclear receptor superfamily utilizing a set of atomic 
coordinates obtained by x-ray diffraction analysis of an invention purified protein 
crystal. The invention assay method comprises creating a three-dimensional structure 

10 of a chimeric protein formed into a functional entity (i.e., by dimerization of 
dimerization domains contained therein) as defined by the atomic coordinates 
obtained by x-ray diffraction studies, employing the three-dimensional structure to 
design or select the potential ligand; synthesizing the potential ligand; and then 
contacting the potential ligand with an invention fimctional entity in the presence of a 

15 response element operatively linked to a marker protein under conditions suitable for 
causing expression of the marker protein to determine the ability of the potential 
ligand to transactivate expression of the marker protein. The potential ligand can be 
designed de novo or designed from a ligand. 

Methods for obtaining a set of atomic coordinates of a protein crystal using x- 
20 ray diffraction and for creating a three-dimensional model of a protein from such a set 
of atomic coordinates are known in the art. Such procedures are disclosed, for 
example, in U. S. Patent No. 5,856,1 16, which is incorporated herein by reference in 
its entirety. For example, x-ray data sets can be collected on a R-axis IIC image plate 
system and/or on a 2.2.A Synchrotron data set for refinement of the three-dimensional 
25 structure (i.e., the model). Then, the data can be collected at Cornell High Energy 
Synchrotron Source ("CHESS") on a charge-couple device and reduced to structure 
factor amplitudes using the Denzo Software Package (Denzo— An Oscillation Data 
Processing Program For Macro Molecular Crystallography, ©1993, Daniel Gewirth, 
Yale University). Oscillation photographs can be integrated and reduced to structure 
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factor amplitudes using software supplied by the manufacturer (Molecular Structures 
Corp., Dallas, Tex.). 

Refined heavy atom parameters can be used to compute multiple isomorphous 
replacement phases. Solvent flattening and phase extension (CCP4-Collaborative 
5 Computing Project No. 4, A Suite of Programs for Protein Crystallography; 

Daresbury Laboratory, Warrington, WA4 4AD, U.K. (1979)) can be used to improve 
the map and allow identification of some of the residues in the protein core. Cycles of 
model building (Quanta, version 4.0b, Molecular Simulations Inc., Burlington Mass.), 
positional refinement, (Brunger, A. T., J. Acta Cryst., A46: 46-57. 1990); Brunger, A. 
10 T. et al., J. Acta Cryst., A46:585-93, 1 990) and phase combmation (CCP4- 

Collaborative Computing Project, supra) can be carried out imtil the switch to phases 
calculated from the model can be made. Refinement against -16 °C., 2.2.A data can be 
continued to allow the more difficult loop regions of the protein to be constructed. 

The invention will now be described in greater detail by reference to the 
15 following non-limiting examples. 

EXAMPLE 1 

Design of ecdysone receptor-Usp/RXR functional dimers 

Two classes of chimeric proteins were constructed as fusion proteins to study 
the activity of EcR-Usp/RXR functional dimers. In one class, EcR is at the N- 
20 terminus of a fusion protein, and in the other class the bmding partner (either Usp or 
RXR) is at the N-terminus (Figure 1). To facilitate formation of the fimctional dimers 
and allow for insertion of polypeptide linkers between the receptor and its binding 
partner in the fusion protein, a 5 amino acid bridge that also encodes the restriction 
endonuclease site for Sfil was inserted between the two open reading frames (ORFs). 

25 A double stranded Sfil compatible oligonucleotide encoding the amino acid 

sequence GPGGGSGGGSGT (SEQ ID NO: 17) was designed to provide a high 
degree of predicted flexibiUty while attempting to minimize repetitive sequence 
within the oligonucleotide. This nucleotide sequence incorporated the Sfil site at the 
5' end of the insert to allow for ease in increasing the number of linker segments 
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within a previously existing construct. By phosphorylating the 36-base-pair double 
stranded oligonucleotides and ligating them into Sfil digested FD plasmid templates, 
FDs were produced with peptide linkers of variable length that increased by 12 amino 
acid increments. 

5 Construction of fusion proteins containing the ecdysone receptor. 

Figure 1 shows the schematically fusion protein functional dimer constructs 
R/U(N)E and ECN)^^- Construction of the invention fusion proteins began with 
modification of the N and C termini of human RXRa, dmUsp and dmEcR ORFs 
subcloned into the cloning vector SK-NBN (pBSK with a modified polylinker). An 
10 Sfil site was inserted at either end of each receptor, in-fi-ame, by PCR mutagenesis. 

For the hRXR N-terminal Sfil site, the primer in the 5' direction was 
GTAGAATTCGGCCAACAGGGCCCATGGACACCAAACATTTC (SEQ ID 
NO: 18); and the primer in the 3' direction was GATGGGGGAGCTCAGGGTGC 
(SEQ ID NO: 19). 

15 For the C-terminal Sfil site, the primer in the 5' direction was 

GGAGAGCTCGAGGCCTACTGCA (SEQ ID NO:20); and the primer in the 3' 
direction was ACCATCGATTCAGGGCCCTGTTGGCCCGTGCGGCGCCTC (SEQ 
ID N0:21). 

For the dmusp N terminal Sfil site, the primer in the 5' direction was 
20 GTAGAATTCGGCCAACAGGGCCCATGGACAACTGCGACCAG (SEQ ID 
NO: 22); and the primer in the 3' direction was CAGCACGTGGACCATTGACA 
(SEQ ID NO:23). 

For the C-terminal Sfil site, the primer in the 5' direction was 
GGAGAGCTCTTTCTCGAGCAGCTG (SEQ ID NO:24); and the primer in the 3' 
25 direction was 

ACCATCGATTCAGGGCCCTGTTGGCCCCTCCAGTTTCATCGCCAGGCCG 
(SEQ ID NO:25). 
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For the ecdysone receptor N-terminal Sfil site, VP 16 sequences were fused in 
frame to the Ncol site approximately 200 base pairs into the ecdysone receptor ORF, 
creating an Sfil site at the VP16-ecdysone receptor boundary. 

For the VP 16 insertion site, the primer in the 5' direction was 
5 CATAAGCTTATGGGACAGACACTGATGGGACGGCCC (SEQ ID NO:26) and 
the primer in the 3' direction was 

CAGAGACCATGGGCCCTGTTGGCCCCCCACC (SEQ ID NO:27). 

For the ecdysone receptor C-terminus insertion site, the primer in the 5' 
direction was TTACCGCTAGCTCCACCA (SEQ ID NO:28); and the primer in the 
10 3 ' direction was GTAGATATCAGGGCCCTGTTGGCCCAGTCGTCGAGT (SEQ 
ID NO:29). All primer sequences are written 5' to 3'. 

For VP16 (S.J. Triezenberg et al. Genes Dev., 2:718-729, 1988) fusion to 
RXR and Usp, the VP 16 sequence region was removed from VE using the Sfil site at 
the 3' boundary of VP 16 sequence for fusion of the 260 base pair VP 16 fragment into 
the N-terminal compatible Sfil site of previously modified RXR and Usp ORFs. All 
fusion receptor variants were originally produced by insertion of both ORFs at the 
central Sfil site. Linker segments with Sfil compatible overhangs were produced by 
aimealing two linker-encoding oligonucleotides having the sequence 
GGGCCAGGAGGTGGCTCCGGGGGAGGTTCAGGCACA (SEQ ID NO:30) in 
the 5' direction, and the sequence 

GCCTGAACCTCCCCCGGAGCCACCTCCTGGCCCTGT (SEQ ID NO:31) in the 
3' direction. 

EcR F-domain deletion constructs were produced by inserting an in-frame 
polylinker upstream of the Sfil -N-terminal modified RXR for reception of compatible 
25 F-domain deleted ecdysone receptor fragments. The polylinker in the 5 ' direction, 
AAGCTTGAGAGATCTGGGACGGCGCCCCCGGGGCTAGCGGGCCAACA 
(SEQ ID NO:32) encoded (fi-om Bgl II) the peptide sequence IWDGAPGAS (SEQ ID 
NO:33) and restriction sites Hind III-Bgl II-Nar I-Sma I-Nhe I with an Sfil 
compatible 3' end. Hind III-Bgl II, Hind III-Nar I, etc. fragments of the ecdysone 
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receptor were inserted into this polylinker for fusion of F-domain deletions to RXR. 
Figure 2B shows schematically the F-domain deletion constructs of EOR made by this 
procedure. 

PGR reactions for production of receptor mutants were performed using 100 
5 ng plasmid template, 500 ng of each primer, and reaction conditions outlined by the 
manufacturer for Pwo (Boerhinger Mannheim) high-fidelity polymerase. A program 
of 1 min. 94°C/1 min. 45°C/1 min. 72°C/1 min. for 20 cycles was used for production 
of all PGR products used. For constructs containing multiple repeat linker segments, 
fusion receptors were Hnearized by Sfil digest, and linker segment oligonucleotides, 

10 kinased to allow multiple tandem insertions, were ligated into the site by standard 
methods. Inserted linker segment repeats of between 0 and 5 linker segments were 
found by restriction endonuclease digest followed by sizing on 3% agarose gels. For 
the studies reported here, the minimum linker length contained only the 5 amino acid 
fusion bridge (signified herein by linker segment designation "0") and the maximum 

15 was 245 amino acids (including a five amino acid fusion bridge) (signified herein by 
linker segment designation "20"). 

Plasmids from clones of interest were prepared on a large scale for use in 
transfection and other analysis including confirmatory sequencing of constructs. All 
receptors were subcloned into vector LNGX (A.D. Miller, GenBank Acc. No. 
20 M28247) (with an extended polylinker) for use in transfection. 

EXAMPLE 2 
Transfection of FD constructs. 

For quantitative transactivation analysis, transfections were performed in 
triplicate in 24-well plates by calcium-phosphate co-precipitation with 100 ng of an 
25 individual receptor, the reporter plasmid E4-luc, and pGHl 10 (SV40-P-galactosidase) 
as an internal control. Briefly, the reporter plasmid, E4-luc, was constructed of 4- 
tandem EcREs inserted upstream of a thymidine kinase gene minimal promoter 
directing luciferase expression. EcRE oligonucleotides were as described by Thomas 
et al., supra (1993) with BamHI/Bglll compatible ends. 1 jiM ligand was added at the 
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time of transfection, and the cells were harvested for luciferase assay 40 hours later. 
Harvested cell extracts were split, and one part was analyzed in a luminometer for 
luciferase activity and the other part was analyzed for P-galactosidase activity using 
an orthonitrophenyl galactoside assay by standard methods. Luciferase levels were 
5 normalized to P-galactosidase values to correct for slight differences in transfection 
efficiency. 

Preparation of the invention fusion protein FDs for gel shift analysis by 
transient transfection into 293 cells was as follows: 300 ng of individual receptor 
plasmids and 100 ng pCHl 10 internal control plasmid were cotransfected into 293 

10 cells at 60% density in Costar 6-well plates. One well of each group was treated with 
1 |a,M murA as ligand at the time of transfection. 40 hours later, extracts of transfected 
293 cells were made by scraping the cells from a well into a low volume of phosphate 
buffered saline, pelleting the cells, resuspending them in 200 p,l 5X gel shift buffer 
(Yao et al, supra 1992), and sonicating with a Kontes cell disrupter for three 10-sec. 

15 bursts at output level 30. The extracts were then centrifuged to clear the lysate and 
the protein was quantified. Extract volumes were adjusted with buffer to a 
concentration of 1 mg/ml and frozen at -70°C until use. P-galactosidase activity was 
assayed as above to determine relative transfection efficiency of each well. 

EXAMPLE 3 

20 Gel mobility shift analysis. 

Comparative gel mobility shift analyses of FDs with control in vitro translated 
receptor complexes were performed using double stranded EcRE probes and labeled 
by Klenow fill using ^'^P-dCTP and cold dGAT by standard methods. In vitro 
translated proteins used as controls were produced using the T3/T7 TNT (Promega) 

25 transcription/translation system following the manufacturer's protocol. In vitro 

translated proteins were qualitatively examined by 5% SDS-PAGE using protocols as 
described (J. Sambrook et al. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring Harbor, New York, 1989) to ensure the presence of full- 
length products of the proper molecular weight. Reaction conditions for protein- 

30 probe interaction and gel electrophoresis were essentially identical to those disclosed 
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in Yao et al, supra 1992. However, to improve comparison between samples, 
reaction mixtures (including dimer partners and probe) were prepared as a cocktail 
and distributed equally to individual tubes with receptor proteins. P-galactosidase 
assay indicated that all samples were essentially equivalent, so equal volumes (10 jil) 
5 of extract were used in each reaction in a final reaction volume of 30 The 
reactions were allowed to proceed at 23 °C for 5 minutes at which time ligand or 
vehicle was added and the reaction allowed to continue for 20 additional minutes. 
Band volumes were quantified using laser scanning densitometry. 

Western blot analysis 

1 0 6-well plates of 293 cells transfected with 2 fxg of receptor construct and 1 00 

ng of pCHl 10 were harvested in PBS and lysed by three rounds of freezing and 
thawing. P-galactosidase assay of lysates indicated equivalent transfection efficiency 
for all constructs so equivalent protein (7.5 |j,g) was loaded and run on a 12.5% SDS- 
PAGE gel and transferred to nitrocellulose by standard methods (Sambrook et al, 

1 5 supra 1 989). The transferred filter was incubated v^th the anti-EcR monoclonal 

antibody DDA2.7 (Koelle et al, supra 1991) at a 1/1000 dilution at 4°C for 48 hoxirs. 
After washes, anti-mouse IgG (1/5000) was added for 1 hour, washed away, and the 
blot processed for chemiluminescence and exposed to film. 

An autoradiogram of FDs and controls treated either with control vehicle or 
20 with 1 i^M of mvirA as ligand was made with markers indicating endodimer FD or 

wild type receptor-binding complex band shifts. E+U and E+R were control lanes of 
in vitro translated proteins for sizing of endodimer band shifts. Figure 2A is a graph 
that quantifies endodimer-sized band volumes obtained from the autoradiogram. 

A prominent band co-migrating with band shifts for the separate dimer 
25 complexes (E+U and E+R) was observed in many lanes and indicated that some of the 
FDs formed functional DNA-binding internal dimers that we designate "endodimers." 
UOE displayed a barely detectable endodimer band that was perceptibly increased (2- 
fold) in intensity by the presence of ligand (Figure 2B). The addition of 5 linker 
segments in U5E appeared to amplify the overall intensity of the shift, but still 
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displayed minimal response to ligand. UNE constructs displayed faint bands above 
the weak endodimer band-shift that were equally as intense. These bands were 
occasionally visible but were proportionally less intense than the endodimer band 
shift of other FDs described below. ROE, which is analogous to UOE but with 
5 substitution of U for R, formed a clear endodimer band that was increased 4-fold by 
ligand (Figure 2B), and unequivocally demonstrated FD responds to hormone. R5E, 
with a longer linker, had slightly decreased basal, and slightly increased ligand- 
stimulated, EcRE binding (5-fold) compared to ROE. The higher molecular weight 
shift bands observed in UNE lanes were not visible in ligand-treated R5E lanes 
10 (Figures 2 A). 

ENU constructs indicated that FDs in which E was positioned at the N- 
terminus formed endodimers and bound the EcRE probe an average of 10 times better 
than UNE constructs. ENU constructs bound probe 80-150% more readily than even 
other FDs with high-level EcRE binding (i.e., E5R, discussed below), but displayed 

15 nearly complete insensitivity to ligand for formation of DNA-binding complexes. 
Like UNE FDs, the longer linker length of E5U did not significantly increase the 
binding to EcRE or responsiveness to murA, relative to EOU (Figures 2A and 2B). 
ENR constructs, like ENUs, also demonstrated a greater affinity for the probe than the 
reversed constructs (Figure 2A). Unlike ENU constructs, ENR FDs had a high degree 

20 of ligand dependence for endodimer formation. E5R displayed a slightly decreased 
shift from the rate of basal transcription, and a slightly elevated ligand- stimulated 
shift (11 -fold relative induction) in comparison to EOR (7-fold), much like ROE and 
R5E. Non-transfected cells, or cells transfected only with E, displayed no detectable 
shift even after prolonged exposure. At concentrations of protein higher than those 

25 used in these gel-shift reactions, a shift of E in combination with endogenous RXR 
was observed. Separate experiments also confirmed that FDs specifically bound 
EcREs and not unrelated control probes that included thyroid hormone response 
elements. 

Figure 3 shows relative luciferase expression of FD constructs with or without 
30 1 |LiM murA. The results of these studies are summarized in Table 1 below and show 
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repression of monomeric receptors and monomeric dimer partners by EOU and UOE 
FDs. 

TABLE 1 





UOE 


ROE 


USE 


R5E 


EOU 


EOR 


E5U 


E5R 


E 


E+U 


-Lig/E4 


0.75 


0.18 


0.52 


0.25 


0.17 


0.21 


0.41 


0.46 


1.08 


1.74 


+Lig/E4 


0.55 


1.75 


0.69 


1.50 


0.40 


3.39 


0.62 


3.89 


7.72 


8.34 


Rel. 


0.7 


9.7 


2.7 


6.0 


2.4 


16.2 


1.6 


8.4 


7.1 


4.8 


Ind. 























5 -Lig/E4 = luciferase activity in cells transiently co-transfected with FDs and 

monomeric receptors vs. reporter only without ligand 

+Lig/E4 =luciferase activity in cells transiently co-transfected with FDs and 
monomeric receptors vs. reporter only with ligand 

Rel. Ind. = relative induction of individual receptor groups 

10 As shown in Table 1, UNE and ENU constructs either did not stimulate 

luciferase expression or appeared to actually function as repressors of basal 
transcription. EOU in the absence of ligand, for instance, reduced E4-luc transcription 
to only 17% of the basal expression level. RNE and ENR constructs, by comparison, 
fimctioned much more like separate E+R, although both the basal and induced levels 

15 were proportionally decreased relative to the monomeric receptors. As might be 

predicted from the mobility shift experiments, E5R provided the closest profile to the 
monomeric (i.e., wild type) separate receptors, having approximately 50% of the E+R 
induced expression level. By contrast, both EOR and E5R, however, had greater 
relative inductions than separate E+R (16.2- and 8.4-fold, respectively, versus 7.1- 

20 fold). 

EXAMPLE 4 

Addition of the potent VP 16 transactivation (t) domain to the N-terminus of 
FD constructs was used to further examine transactivation by FDs. To test the 
possibility that distortion of the endodimer by a short linker (0 linker segments) 



65 



contributed to inhibited transactivation, the number of linker segments in the linker 
between the units in the invention fusion proteins was expanded to a maximum of 20 
linker segments for ENR and ENU variants. As shown in Figure 4 A, addition of a 
VP 16 X domain to either the E monomer (VE) or to FDs (VENR or VENU) resulted 
5 in a 12 to 15-fold overall increase in luciferase expression. These VENR FDs 
containing linkers of variable length produce a stunulated level of transcription 
virtually identical to separate VE protein, suggesting that augmentation of ENR 
proteins with VP 16 x domain compensated for any loss in transactivation resulting 
from the fusion of the receptor and dimer partner into the invention fusion proteins. 

10 Notably, however, the level of basal, induced expression was increased 7 to 8-fold 
over monomeric receptors. This resulted in a dramatic decrease in the relative fold 
induction, from 27.8-fold for separate VE to 4.1 to 5.4-fold for FDs (Figure 4A). In 
addition, the increase from a 5 amino acid linker to a 245 amino acid linker had little 
effect on either basal or activated VENR transactivation with the exception of a subtle 

15 decrease in both levels in the FDs having the longer 125 (10 linker segments) and 245 
amino acid (20 linker segment) linkers. As shown in Figure 4B, VENU constructs 
displayed neither the basal nor induced level of expression of separate VE+U, even 
though the overall level of transactivation was significantly increased relative to E+U 
complexes without a heterologous transactivating domam. Like the VENR proteins, 

20 the addition of increased linker segments to the VENU constructs had a minimal 
effect on basal or induced transcriptional activation. 

EXAMPLE 5 

To elucidate the propensity of fusion protein partners to dimerize with each 
other over other monomeric suitable dimer partners, the influence of monomeric 

25 dimer partners on FD transactivation properties was assayed. In the transient 

transfection experiments shown in Figure 6, R and U proteins with N-terminal VP 16 
fusions (VR and VU, respectively) were utilized to probe E5U and E5R promiscuity. 
E5R and E5U constructs were used because previous experiments suggested that they 
displayed properties that were the most similar to monomeric (i.e., wild type) 

30 receptors. When both E5R and E5U constructs were cotransfected into 293 cells in 
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equimolar quantities along with the E4-luc reporter, VR cotransfection was not 
observed to significantly influence either E5U or E5R function, even though VR was 
found to augment E-mediated transactivation alone by >5-fold either with or vidthout 
ligand (Figure 6). VU, on the other hand, delectably interacted with E5R, and to a 
5 lesser extent with E5U. Ligand-dependent transactivation was observed for 

E5U+VUsp at approximately 10% of the ligand-stimulated level of VUsp v^th E, 
whereas E5R+VUsp activated to nearly 50% of the VUsp + E level. VR and VU 
alone had no influence on E4-luc expression either vidth or without murA. 

EXAMPLE 6 

10 Using a modification of the method for constructing invention fusion proteins 

described above in Example 1, fusion proteins were constructed having a Bombyx 
ecdysone receptor (BEcR) in the amino terminal half of the fusion protein, a linker 
bridge of 5 amino acids, and either RXR (BEOR) or Usp (BEOU) as the dimer partner 
placed at the C-terminal half of the fusion protein. To facilitate cloning, the BEcR 

15 amino acid sequence in each of these fusion proteins was augmented at the C-terminal 
end with amino acids 650-878 from the Drosophila melanogaster ecdysone receptor. 
Similar constructs were made wherein a His tag was positioned at the amino terminus 
of the fusion protein to facilitate purification of the fusion protein (HisBEOR and 
HisBEOU, respectively). 

20 Gel mobility shift assays using tebufenozide and MurA as ligand were 

conducted as described in Example 3 above to determine whether functional dimers 
formed from the BEOR and BEOU fusion proteins. The results of these assays showed 
the both BEOR and BEOU dimerize and constitutively bind target DNA irrespective of 
the presence of ligand. When the study was repeated using HisDEOR, HisBEOR and 

25 HisBEOU, it was determined that the His tag does not effect the ability of these fusion 
proteins to bind target DNA. 

It will be apparent to those skilled in the art that various changes may be made 
in the invention without departing from the spirit and scope thereof, and therefore, the 
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invention encompasses embodiments in addition to those specifically disclosed in the 
specification. 
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WHAT IS CLAIMED IS: 

1 . A chimeric protein comprising: 

at least two functional protein imits, wherein each functional protein 
unit comprises the dimerization domain of a member of the steroid/thyroid 
hormone nuclear receptor superfamily, and 

an optional linker interposed therebetween, 

wherein the at least two protein vmits form a functional entity. 

2. The chimeric protein according to claim 1 wherein the entity is an 
endodimer. 

3. The chimeric protein according to claim 1 wherein each protein imit 
comprises a ligand binding domain, an optional hinge domain, and an optional 
DNA binding domain. 

4. The chimeric protein according to claim 3 wherein the functional entity 
is an endodimer. 

5. The chimeric protein according to claim 1 wherein at least one member 
is non-mammalian. 

6. The chimeric protein according to claim 5 wherein the at least one 
member is from an insect species. 

7. The chimeric protein according to claim 1 wherein at least one 
functional protein unit comprises the dimerization domain of an ecdysone receptor. 

8. The chimeric protein according to claim 7 wherein the ecdysone 
receptor comprises the dimerization domain of a. Drosophila ecdysone receptor. 
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9. The chimeric protein according to claim 7 wherein the ecdysone 
receptor comprises the dimerization domain of a Lepidoptera ecdysone receptor. 

1 0. The chimeric protein according to claim 7 wherein the ecdysone 
receptor comprises the dimerization domain of a Bombyx ecdysone receptor. 

1 1 . The chimeric protein according to claim 5 wherein at least one 
fimctional protein unit comprises the dimerization domain of the ultraspiracle protein. 

12. The chimeric protein according to claim 1 wherein at least one member 
is non-mammalian. 

13. The chimeric protein according to claim 1 wherein at least one 
functional protein imit comprises the dimerization domain of the retinoid X receptor. 

14. The chimeric protein according to claim 1 wherein the protein units are 
independently selected from the group consisting of glucocorticoid receptors, 
mineralocorticoid receptors, estrogen receptors, progesterone receptors, androgen 
receptors. Vitamin D3 receptors, retinoic acid receptors, retinoid X receptors, 
peroxisome proliferator-activated receptors, thyroid hormone receptors, and steroid 
and xenobiotic receptors, famesoid X receptor, pregnenolone X receptor, liver X 
receptor, and BXR. 

15. The chimeric protein according to claim 1 wherein the linker contains 
from about 5 to about 245 amino acids. 

16. The chimeric protein according to claim 15 wherein the linker contains 
from about 53 to about 125 amino acids. 

17. The chimeric protein according to claim 15 wherein the linker 
comprises glycine, proline, serine, alanine and threonine residues. 
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1 8. The chimeric protein according to claim 1 5 wherein the chimeric 
protein is a chimeric protein and the linker comprises the amino acid sequence of SEQ 
ID NO: 15. 

19. The chimeric protein according to claim 3 wherein one or more protein 
units further comprise a C-terminal domain. 

20. The chimeric protein according to claim 3 wherein the DNA binding 
domains of one or more protein units comprise 66 to 68 amino acids, including 9 
cysteines. 

2 1 . The chimeric protein according to claim 3 wherein the hinge domain of 
one or more protein units is the Bombyx hinge domain. 

22. The chimeric protein according to claim 1 wherein one or more protein 
units further comprise an activation domain. 

23. A polynucleotide encoding a chimeric protein according to claim 1 . 

24. A cell containing a polynucleotide according to claim 23. 

25. The cell according to claim 24 wherein the cell is mammalian. 

26. A method for modulating the expression of an exogenous gene in a 
subject organism containing: 

1) a chimeric protein according to claim 1, and 

2) a DNA construct comprising the exogenous gene vmder the 
control of a response element with which the chimeric protein interacts, 

said method comprising administering to the subject an effective 
amount of an exogenous ligand for at least one functional unit of the chimeric 
protein. 
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27. The method according to claim 26 wherein the chimeric protein is 
encoded by a DNA construct. 

28. The method according to claim 26 wherein the subject organism is a 
plant, an animal, a fungus or a bacterium. 

29. The method according to claim 26 wherein the subject organism is 
mammalian. 

30. The method according to claim 28 wherein at least one of the protein 
units is non-mammalian. 

3 1 . The method according to claim 29 wherein at least one of the protein 
imits is from an insect species. 

32. The method according to claim 26 wherein the modulation is ligand- 
dependent repression. 

33. A method for modulating the expression of an exogenous gene in a 
subject organism containing a DNA construct comprising the exogenous gene under 
the control of a response element, 

said method comprising administering to the subject an effective 
amount of a chimeric protein according to claim 1, 

wherein the modulation is independent of ligand. 

34. The method according to claim 33 wherein the modulation is ligand 
independent activation. 

35. The method according to claim 33 wherein the modulation is ligand 
independent repression. 
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36. A method for modulating the expression of an gene in a subject 
organism containing a chimeric protein according to claim 1, 

said method comprising introducing to the subject an effective amount 
of a DNA construct comprising the gene under the control of a response 
5 element, 

wherein the response element is responsive to the chimeric protein and 
wherein the modulation is independent of ligand for the chimeric protein. 

37. The method according to claim 36 wherein the gene is exogenous. 

38. The method according to claim 36 wherein the modulation is ligand 
independent activation. 

39. The method according to claim 36 wherein the modulation is ligand 
independent repression. 

40. A method for modulating the expression of an exogenous gene in a cell 
containing: 

1) a chimeric protein according to claim 1 and 

2) a DNA construct comprising the exogenous gene imder the 
5 control of a response element with which the chimeric protein interacts, 

wherein said response element controls expression of the exogenous gene, 

said method comprising administering to the cell an effective amomt 
of an exogenous ligand for at least one functional unit of the chimeric protein. 

41 . The method according to claim 40 wherein the modulation is ligand 
independent activation. 

42. The method according to claim 40 wherein the modulation is ligand 
independent repression. 
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43. A method for modulating the expression of one or more genes in a 
subject organism containing an endogenous response element, wherein said response 
element controls expression of one or more genes 

said method comprising introducing a chimeric protein according to 
5 claim 1 to the subject that interacts with said response element, thereby 

modulating expression of the gene(s) independent of the presence of ligand for 
the chimeric protein. 

44. The method according to claim 43 wherein the chimeric protein is 
encoded by an inducible DNA construct and the modulating comprises inducing 
expression of the gene(s). 

45. A method for modulating the expression of one or more genes in a 
subject organism containing an endogenous response element controlling expression 
of one or more genes, 

said method comprising introducing to the subject a chimeric protein 
5 according to claim 1 that interacts with the response element, thereby 

modulating expression of the gene(s) dependent on the presence of 
endogenous ligand therefor. 

46. The method according to claim 45 wherein the chimeric protein is 
encoded by an inducible DNA construct and the modulating comprises inducing 
expression of the gene(s). 
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47. A method for modulating the expression of one or more genes in a 
subject organism containing: 

1) a chimeric protein according to claim 1, and 

2) an endogenous response element controlling expression of the 
one or more genes, wherein the chimeric protein interacts with the response 
element, 

said method comprising introducing to the subject an exogenous ligand 
for the chimeric protein, thereby modulating expression of the gene(s) 
dependent on the presence of the exogenous ligand. 



48. A method for modulating the expression of one or more genes in a 
subject organism containing: 

1) a chimeric protein according to claim 1, and 

2) an endogenous response element controlling expression of the 
one or more genes, wherein the chimeric protein interacts with the response 
element, 

said method comprising introducing to the subject an exogenous ligand 
for the chimeric protein, thereby modulating expression of the gene(s) 
dependent on the presence of the exogenous ligand. 

49. A method for modulating the expression of one or more genes in a 
subject organism containing: 

a chimeric protein according to claim 1, and 
an exogenous ligand for the chimeric protein, 
said method comprising introducing to the subject an endogenous 
response element controlling expression of the one or more genes, wherein the 
chimeric protein interacts with the response element, thereby modulating 
expression of the gene(s) dependent on the presence of the exogenous ligand. 
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50. A method for modulating the expression of one or more genes in a 
subject organism containing a chimeric protein according to claim 1, 

wherein said method comprises introducing to the subject an 
exogenous response element controlling expression of the one or more genes, 
5 wherein the response element interacts with the chimeric protein 

thereby modulating expression of the gene(s) independent of the presence of 
ligand for the chimeric protein, 

51. A method for modulating the expression of one or more genes in a 
10 subject organism containing an exogenous response element controlling 

expression of the one or more genes, 

said method comprising introducing to the subject a chimeric protein 
according to claim 1 that interacts with the response element, thereby 
modulating expression of the gene(s) independent of the presence of ligand for 
15 the chimeric protein. 

52. An isolated protein crystal suitable for x-ray diffraction analysis 
comprising a purified chimeric protein according to claim 1. 

53. The protein crystal according to claim 52 further comprising a ligand 
bound to the purified chimeric protein so as to form a chimeric protein-ligand 
complex. 

54. The protein crystal according to claim 53 further comprising a nucleic 
acid construct being a putative response element for the complex. 

55. A set of x-ray diffraction crystal coordinates obtained by x-ray 
diffraction of the isolated protein crystal according to claim 52. 

56. A set of x-ray diffraction crystal coordinates obtained by x-ray 
diffi-action of the protein crystal according to claim 54. 
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57. A method for identifying a potential ligand for a member of the 
steroid/thyroid hormone receptor superfamily, said method comprising: 

creating a three-dimensional structure of a chimeric protein as defined 
by the x-ray diffraction coordinates according to claim 55, 

employing said three-dimensional structure to design or select the 
potential ligand; 

synthesizing the potential ligand; and 

contacting the potential ligand with the chimeric protein in the 
presence of the response element with which the chimeric protein interacts 
operatively linked to a marker gene under conditions suitable for causing 
expression of the marker gene to determine the ability of said potential ligand 
to transactivate expression of the marker gene. 

58. A method for identifying compounds that modulate formation of a 
functional entity in a cell containing: 

a chimeric protein according to claim 1, and 
a response element with which the chimeric protein interacts 
operatively linked to a marker protein, 

said method comprising contacting the cell with a test compoimd vmder 
conditions suitable to cause the chimeric protein to transactivate expression of 
the marker gene, and 

determining the amount of the marker protein produced as compared 
with the amount produced in the absence of the test compound, 

wherein a difference in the amount of marker gene expressed indicates 
a modulation of formation of the functional entity due to the presence of the 
test compound. 

59. The method according to claim 58 wherein the amount of marker 
protein expressed is increased, indicating that the test compound facilitates formation 
of the functional entity. 
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60. The method according to claim 58 wherein the amount of marker 
protein expressed is decreased, indicating that the test compound represses formation 
of the functional entity. 
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ABSTRACT 

The invention provides chimeric proteins having at least two functional 
protein units, each containing the dimerization domain of a member of the 
steroid/thyroid hormone nuclear receptor superfamily. The chimeric proteins can fold 
vmder crystallization conditions to form functional entities. The functional entities 
optionally contain a novel flexible peptide linker of variable lengths between at least 
two of the protein units. In a preferred embodiment, the linker is designed to be 
increased in increments of 12 amino acids each to aid in preparation of variant 
chimeric proteins The DNA binding characteristics of the invention functional 
entities differ from those of wild-type complexes formed between "monomeric" 
receptors and their binding partners. Some functional entities, e.g. dimers expressed 
as fusion proteins, transactivate responsive promoters in a maimer similar to wild-type 
complexes, while others do not promote transactivation and function instead 
essentially as constitutive repressors. The invention further provides nucleotide 
sequences encoding the invention chimeric proteins, cells containing such nucleotide 
sequences, and methods for using the invention chimeric proteins to modulate 
expression of one or more exogenous genes in a subject organism. In addition, 
isolated protein crystals suitable for x-ray diffraction analysis and methods for 
obtaining putative ligands for the invention chimeric proteins are provided. 
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